Displaying 1 to 7 from 7 results

Arachnode.net

  •    CSharp

An open source .NET web crawler written in C# using SQL 2005/2008. Arachnode.net is a complete and comprehensive .NET web crawler for downloading, indexing and storing Internet content including e-mail addresses, files, hyperlinks, images, and Web pages.

YaCy - Decentralized Web Search

  •    Java

YaCy (read "ya see") is a free distributed search engine, built on principles of peer-to-peer (P2P) networks. It is distributed on several hundred computers so-called YaCy-peers. Each YaCy-peer independently crawls through the Internet, analyzes and indexes found web pages, and stores indexing results in a common database which is shared with other YaCy-peers using principles of P2P networks.

Business Data - web information retrivial

  •    

We try to develop an opensource website crawler to retrieve business and marketing data from web sites or search engines.

arachnode.net

  •    DotNet

http://arachnode.net 2.6 release +lucene.net




Squzer - Distributed Web Crawler

  •    Python

Squzer is the Declum's open-source, extensible, scale, multithreaded and quality web crawler project entirely written in the Python language.

NWebCrawler

  •    

This is a web crawler program written in C#.

Data Extracting SDK

  •    

Data Extracting SDK can help you to extract information from the web resources in a simple way.