We have collection of more than 1 Million open source products ranging from Enterprise product to
small libraries in all platforms. We aggregate information from all open source repositories.
Search and find the best for your needs. Check out projects section.
Harvest is a web indexing package, originally disigned for distributed indexing, it can form a powerful system for indexing both large and small web sites. Also now includes Harvest-NG a highly efficient, modular, perl-based web crawler.
WebHarvest mirror (https://sourceforge.net/projects/web-harvest), with some modifications on the code done by me, like the complete porting to HttpComponents 4.2.3. I will be glad if WebHarvest maintainers want to merge my branch and take care of my additions.