Crawl-By-Example runs a crawl, which classifies the processed pages by subjects and finds the best pages according to examples provided by the operator. Crawl-By-Example is a plugin to the Heritrix crawler, and was done as a part of GSoC06 program.

Project Activity

See All Activity >

License

GNU Library or Lesser General Public License version 2.0 (LGPLv2)

Follow Crawl-By-Example (Heritrix plugin)

Crawl-By-Example (Heritrix plugin) Web Site

Other Useful Business Software
Data management solutions for confident marketing Icon
Data management solutions for confident marketing

For companies wanting a complete Data Management solution that is native to Salesforce

Verify, deduplicate, manipulate, and assign records automatically to keep your CRM data accurate, complete, and ready for business.
Learn More
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Crawl-By-Example (Heritrix plugin)!

Additional Project Details

Languages

English

Intended Audience

Advanced End Users, Developers, Science/Research

User Interface

Web-based

Programming Language

Java

Related Categories

Java Search Engines, Java Information Analysis Software

Registered

2007-02-12