Crawl-By-Example runs a crawl, which classifies the processed pages by subjects and finds the best pages according to examples provided by the operator. Crawl-By-Example is a plugin to the Heritrix crawler, and was done as a part of GSoC06 program.
License
GNU Library or Lesser General Public License version 2.0 (LGPLv2)Follow Crawl-By-Example (Heritrix plugin)
Other Useful Business Software
Data management solutions for confident marketing
Verify, deduplicate, manipulate, and assign records automatically to keep your CRM data accurate, complete, and ready for business.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of Crawl-By-Example (Heritrix plugin)!