Displaying 1 to 3 from 3 results

slinky - web crawler just for links

  •    Javascript

Slinky is a web crawler, but just for the links between webpages. Slinky is intended to be used to visualize the routes and structure behind a website by collecting hyperlinks. If you decide to print out the source code and drop it down a flight of stairs, you may not be disappointed either.

supercrawler - A web crawler

  •    Javascript

Supercrawler is a Node.js web crawler. It is designed to be highly configurable and easy to use. When Supercrawler successfully crawls a page (which could be an image, a text document or any other file), it will fire your custom content-type handlers. Define your own custom handlers to parse pages, save data and do anything else you need.

robotparser-scala - robotparser-scala implements a parser for the robots.txt file format in Scala.

  •    Scala

robotparser-scala implements a parser for the robots.txt file format in Scala. And then, you have RobotsTxt instance. By default, character encoding is UTF-8.