Displaying 1 to 6 from 6 results

Terrier - Information Retrieval Platform


Terrier is a highly flexible, efficient, and effective open source search engine, readily deployable on large-scale collections of documents. Terrier implements state-of-the-art indexing and retrieval functionalities, and provides an ideal platform for the rapid development and evaluation of large-scale retrieval applications. Terrier can index large corpora of documents, and provides multiple indexing strategies, such as multi-pass, single-pass and large-scale MapReduce indexing.

Lemur - Search Engine


The Lemur Project develops search engines, browser toolbars, text analysis tools, and data resources that support research and development of information retrieval and text mining software. The project is best known for its Indri search engine, Lemur Toolbar, and ClueWeb09 dataset.

Poly-ir-toolkit - Information retrieval toolkit for large document collections.


Developed at the Web Exploration and Search Technology Lab at the Polytechnic Institute of NYU, under the advisement of Professor Torsten Suel, PolyIRTK provides tools for indexing and querying large document collections. The aim of PolyIRTK is to act as a platform for research into new algorithms for compression/decompression, querying, and indexing techniques. It implements a number of techniques from recent research literature that improves on the efficiency of indexing and querying. PolyIRTK

Java-ir-utils - Information Retrieval Utilities for Java


Маленькие, но полезные кла��ы дл� задач IR на Java

Ir-themis - Information Retrieval framework


Themis - Information Retrieval frameworkThemis is an Information Retrieval (IR) framework for comparison of natural language documents. It includes implementation of theoretical retrieval models (as for now, Vector Space Model (VSM) and enhanced Topic-based Vector Space Model (eTVSM)). Themis includes implementation of common algorithms used in the Information Retrieval domain (as for now, Porter Stemmer). These algorithms might then be reused while implementation/configuration of IR models. The