Bigtablesearch - An experimental library for full-text searching on Python App Engine applications

  •        0

bts (BigTable Search) aims to bring scalable full-text search to Python applications hosted on Google's App Engine. It supports search feature like Stemming and stopword removal, Multi-term queries, Faceted search and lot more.

http://code.google.com/p/bigtablesearch

Tags
Implementation
License
Platform

   

comments powered by Disqus


Related Projects

Lucene


Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that requires full-text search, especially cross-platform.

Solr


Solr is the popular, blazing fast open source enterprise search platform from the Apache Lucene project. Its major features include powerful full-text search, hit highlighting, faceted search, dynamic clustering, database integration, and rich document (e.g., Word, PDF) handling. Solr is highly scalable, providing distributed search and index replication, and it powers the search and navigation features of many of the world's largest internet sites.

Jackrabbit - Content Repository in Java


Apache Jackrabbit is a Content Repository fully conforming to JCR specification. Jackrabbit content repository is a hierarchical content store with support for structured and unstructured content, full text search, versioning, transactions, observation, and more.

IndexTank - Search Engine powers Reddit


IndexTank search engine powers search in Reddit, Social bookmarking site. IndexTank is acquired by LinkedIn and released the project as open source. It includes features like Variables boosts, Facets, Faceted search, Snippeting, Custom scoring functions, Suggest, and Autocomplete.

Sphinix


Sphinix is free open-source SQL full-text search engine. How do you implement full-text search for that 10+ million row table, keep up with the load, and stay relevant? Sphinx is good at those kinds of riddles.

HBase - Hadoop database


HBase provides support to handle BigTable - billions of rows X millions of columns. It is a scalable, distributed, versioned, column-oriented store modeled after Google's Bigtable and runs on top of HDFS (Hadoop Distributed Filesystem). It features compression, in-memory operation per-column. Data could be replicated between the nodes. HBase is used in Facebook and Twitter.

Nutch


Nutch is open source web-search software. It builds on Lucene Java, adding web-specifics, such as a crawler, a link-graph database, parsers for HTML and other document formats, etc.

OpenCMS


OpenCms from Alkacon Software is a professional, easy to use website content management system. OpenCms helps content managers worldwide to create and maintain beautiful websites fast and efficiently. OpenCms is based on Java and XML technology.

OTRS - Helpdesk platform in Perl


OTRS is the leading open-source service innovation suite and includes Help Desk, an ITIL-compatible IT service management (ITSM) solution, an iPhone App and an underlying technology platform. It provides web interface to administer and manage the tickets. Email notifications will be sent out for new tickets and follow-ups.

PDFBox - Java PDF library


Apache PDFBox is an open source Java PDF library for working with PDF documents. This library allows creation of new PDF documents, manipulation of existing documents and the ability to extract content from documents. It provides support for adding bookmarks, fonts, text extraction, Encryption, PDF printing and lot more.