Displaying 1 to 20 from 25 results

TagSoup - SAX-compliant parser in Java

  •    Java

TagSoup, a SAX-compliant parser written in Java that, instead of parsing well-formed or valid XML, parses HTML as it is found in the wild: poor, nasty and brutish, though quite often far from short. TagSoup is designed for people who have to process this stuff using some semblance of a rational application design. TagSoup also includes a command-line processor that reads HTML files and can generate either clean HTML or well-formed XML that is a close approximation to XHTML.

oboe.js - A streaming approach to JSON.

  •    Javascript

Oboe.js is an open source Javascript library for loading JSON using streaming, combining the convenience of DOM with the speed and fluidity of SAX. It can parse any JSON as a stream, is small enough to be a micro-library, doesn't have dependencies, and doesn't care which other libraries you need it to speak to.

parse5 - HTML parsing / serialization toolset for Node.js

  •    Javascript

parse5 provides nearly everything you may need when dealing with HTML. It's the fastest spec-compliant HTML parser for Node to date. It parses HTML the way the latest version of your browser does. It has proven itself reliable in such projects as jsdom, Angular2, Polymer and many more.


  •    C

libxml++ is a C++ wrapper for the libxml XML parser library.


  •    C++

Xerces-C++ is a validating XML parser written in a portable subset of C++. Xerces-C++ makes it easy to give your application the ability to read and write XML data.


  •    C++

Arabica is an XML and HTML processing toolkit, providing SAX, DOM, XPath, and partial XSLT implementations, written in Standard C++.

Nokogiri - HTML, XML, SAX, and Reader parser with XPath and CSS selector support

  •    Ruby

Nokogiri (?) is an HTML, XML, SAX, DOM parser. Among Nokogiri's many features is the ability to search documents via XPath or CSS3 selectors, XML/HTML builder, XSLT transformer. Nokogiri parses and searches XML/HTML using native libraries (either C or Java, depending on your Ruby), which means it's fast and standards-compliant.


  •    Java

Piccolo is a small, extremely fast XML parser for Java. It implements the SAX 1, SAX 2.0.1, and JAXP 1.1 (SAX parsing only) interfaces as a non-validating parser and attempts to detect all XML well-formedness errors. Piccolo was developed by Yuval Oren.


  •    Python

lxml is a Pythonic binding for the libxml2 and libxslt libraries.


  •    Perl

Perl interface to Gnome libxml2 xml parsing and DOM library.


  •    C#

This is a SAX for .NET parser implementation based on the popular Expat XML parser.

parser-lib - Collection of parsers written in JavaScript

  •    Javascript

The ParserLib CSS parser is a CSS3 SAX-inspired parser written in JavaScript. It handles standard CSS syntax as well as validation (checking of property names and values) although it is not guaranteed to thoroughly validate all possible CSS properties.The CSS parser is built for a number of different JavaScript environments. The most recently released version of the parser can be found in the dist directory when you check out the repository; run npm run build to regenerate them from the latest sources.

node-expat - libexpat XML SAX parser binding for node.js

  •    Javascript

We don't emit an error event because libexpat doesn't use a callback either. Instead, check that parse() returns true. A descriptive string can be obtained via getError() to provide user feedback. Alternatively, use the Parser like a node Stream. write() will emit error events.

web-html-stream - Efficient element matching and processing for HTML5 DOM serialized HTML.

  •    HTML

Efficient streaming element matching and processing for HTML5 DOM serialized HTML. Works with Web Streams as returned by fetch.web-html-stream gets much of its efficiency from leveraging the syntactic regularity of HTML5 and XMLSerializer DOM serialization.

saxpath - XPath evaluator against a SAX XML stream

  •    Javascript

Simple XPath evaluator which runs against a SAX stream. Instantiate a new SaXPath object with new saxpath.SaXPath(saxParser, xpath [, recorder]). Then pipe a stream into it and SaXPath will emit match events on each XPath match. The emitted data is managed by the recorder, which by default recreates on the fly as a (XML) string from the SAX events.

node-elementtree - Port of Python's Element Tree module to Node.js

  •    Javascript

node-elementtree is a Node.js XML parser and serializer based upon the Python ElementTree v1.3 module. For the usage refer to the Python ElementTree library documentation - http://effbot.org/zone/element-index.htm#usage.

osm2obj - Take a readable stream of OSM XML and output a stream of objects

  •    Javascript

Implements a Node Transform Stream. Takes a readable stream of OSM XML and outputs a stream of objects compatible with Overpass OSM JSON. Also reads OsmChange XML and outputs the same format but with an additional property action which is one of create, modify, delete. Uses sax-js to work in both node and the browser. Any attribute that is not a valid OSM XML attribute will be ignored (see WHITELISTS). tag, member, or nd elements without the required attributes will throw an error. The readable side of the stream is in objectMode.

node-opds-parser - OPDS Catalog Feed Parser for node

  •    Javascript

Parses OPDS Catalog Feed to JSON using Isaac Schlueter's sax parser. Roughly 45,000 entries/s on 2.4 GHz Core i5 MacBook Pro.

node-halfstreamxml - converts a stream of XML to a stream of objects

  •    CoffeeScript

This module converts streams of XML to streams of JS objects. Unlike other SAX modules, it doesn't give you every tag and text node seperately, but lets you specify tag names you want to filter for. When this module encounters a tag with that name, its content will be collected until the closing tag was reached, and then you will receive the tag with all its attributes and children in one object. To compile the .coffee files to Javascript, use cake build. You will need Coffeescript for that.

node-xml-lite - Pure javascript XML SAX parser for Node.js

  •    Javascript

Pure javascript XML ANSI/Unicode SAX parser for Node.js

We have large collection of open source products. Follow the tags from Tag Cloud >>

Open source products are scattered around the web. Please provide information about the open source projects you own / you use. Add Projects.