Displaying 1 to 8 from 8 results

cheerio - Fast, flexible, and lean implementation of core jQuery designed specifically for the server

  •    Javascript

❤ Familiar syntax: Cheerio implements a subset of core jQuery. Cheerio removes all the DOM inconsistencies and browser cruft from the jQuery library, revealing its truly gorgeous API.ϟ Blazingly fast: Cheerio works with a very simple, consistent DOM model. As a result parsing, manipulating, and rendering are incredibly efficient. Preliminary end-to-end benchmarks suggest that cheerio is about 8x faster than JSDOM.

parse5 - HTML parsing/serialization toolset for Node

  •    Javascript

HTML parsing/serialization toolset for Node.js. WHATWG HTML Living Standard (aka HTML5)-compliant.parse5 provides nearly everything you may need when dealing with HTML. It's the fastest spec-compliant HTML parser for Node to date. It parses HTML the way the latest version of your browser does. It has proven itself reliable in such projects as jsdom, Angular2, Polymer and many more.

ti-htmlparser2 - Forgiving HTML/XML parser for Titanium SDK

  •    Javascript

This is a titaniumified version of htmlparser2. This is built using grunt-titaniumifier. A packaged CommonJS module can be found in the Releases page.

htmlparser-to-html - Converts the JSON that the htmlparser/htmlparser2 package produces back to HTML

  •    Javascript

Converts the JSON that htmlparser (and probably htmlparser2) produces back to HTML. Useful if you're doing some sort of transformation.




ti-html2as - HTML 2 AttributedString converter for Titanium

  •    Javascript

HTML to Ti.UI.AttributedString parser for Titanium. The module exports a single function that takes an HTML string and a callback to receive an error or Ti.UI.AttributedString object.

semantic-schema-parser

  •    Javascript

A Nodejs module to extract http://schema.org micro-data from HTML and convert it in a JSON object. The example will create a file named result.json based in a URLs list. That file have a text example of the generated JSON object.

SUq - A nodejs Scraping Utility for lazy people. MIT Licensed

  •    Javascript

Here's a simple node module that will allow you to asynchronously scrape opengraph tags, microformats, microdata, header tags, images, classic meta, and whatever else you want with minimal effort. You can output the scraped data in the command line, or you can output scraped data as a JSON object. If you don't want the scraped data yet, and still want to fine tune and grab more data from the html, no problem. You can extend suq as much as you want, it doesn't care. Scrape a website and output the data to command line.