Displaying 1 to 11 from 11 results

toapi - Every web site provides APIs.

  •    Python

Toapi give you the ability to make every web site provides APIs. Version v2.0.0, Completely rewrote.

node-Tor - Javascript implementation of the Tor (or Tor like) anonymizer project (The Onion Router)

  •    Javascript

For a quick look, see the demo video on Peersm, download and stream anonymously inside your browser, serverless anonynous P2P network compatible with torrents. Check out torrent-live for a more general presentation and to get the dynamic blocklist.

.net HTTP Data Extractor

  •    LINQ

The nWeb Data Extractor Library provides support for extracting data from the http response html, it allows user to convert http response HTML to XML, then allows user to extract desired data form the generated xml file.

hacker-news-digest - :newspaper: A responsive interface of Hacker News with summaries and illustrations

  •    Python

This service extracts summaries and illustrations from hacker news articles for people who want to get the most out of hacker news while cutting down the time spent on deciding which one to read and which to skip.




node-krawler - Fast and lightweight web crawler with built-in cheerio, xml and json parser.

  •    Javascript

mikeal/request is used for fetching web pages so any desired option from this package can be passed to Krawler's constructor. After Krawler emits the 'data' event, it automatically continues to a next url address. It does not care if the result was processed or not. If you would like to have a full control over the result handling, you can turn on the custom callback option. Then you can control the program flow by invoking your callback. Don't forget to call it in every case, otherwise the queue will stuck.

snapshooter - Simple crawler for Single Page Applications

  •    CoffeeScript

Simple crawler for Single Page Applications. Snapshooter will load a URL, wait the javascript to render and save it as plain HTML.

XML-Parser - A Node.js XML DOM, Parser & Stringifier.

  •    Javascript

Parse XML, HTML and more with a very tolerant XML parser and convert it into a DOM. These three components are separated from each other as own modules.

graphquery - GraphQuery is a query language and execution engine tied to any backend service.

  •    Go

GraphQuery is a query language and execution engine tied to any backend service. It is back-end language independent. GraphQuery is an easy to use query language, it has built-in Xpath/CSS/Regex/JSONpath selectors and enough built-in text processing functions. The most amazing thing is that you can use the minimalist GraphQuery syntax to get any data structure you want.


node-bot - Fast and real-time extraction of web pages information (html, text, etc) using node-dom based on given criterias (example : retrieves real-time the price of a product)

  •    Javascript

Real-time extraction of web pages information (html, text, etc) based on given criterias. It can be used as a server or an API, then parameters are passed in the URL, or directly as an independant node.js module.

Mechanize

  •    CSharp

Stateful programmatic web browsing, based on Python-Mechanize, which is based on Andy Lester’s Perl module WWW::Mechanize.

html-query - A fluent and functional approach to querying HTML

  •    Go

html-query is a Go package that provides a fluent and functional interface for querying HTML DOM. It is based on golang.org/x/net/html. A large part of html-query is automatically generated from HTML spec. The spec is in HTML format, so the generator parses it using html-query itself.