Displaying 20 to 40 from 41 results

Strus - Full text Search Engine in C++


The open source project strus provides a collection of C++ (C++98) libraries and command line tools for building a full-text search engine. The strus search engine can be build using any key value store database that provides an upper bound seek function for the stored key/value pairs. Currently there exists an implementation based on the LevelDB library.

PyLucene - Python extension for accessing Java Lucene


PyLucene is a Python extension for accessing Java Lucene. Its goal is to allow you to use Lucene's text indexing and searching capabilities from Python. It is API compatible with the latest version of Java Lucene, PyLucene is not a Lucene port but a Python wrapper around Java Lucene. PyLucene embeds a Java VM with Lucene into a Python process.

Ferret - The extensible information retrieval library for ruby.


Ferret is an information retrieval library in the same vein as Apache Lucene. Originally it was a full port of Lucene but it now uses it's own file format and indexing algorithm although it is still very similar in many ways to Lucene. Everything you can do in Lucene you should be able to do in Ferret.

Apache Lucy - Search engine library provides full-text search for dynamic programming languages.


The Apache Lucy search engine library provides full-text search for dynamic programming languages.




LucenePlusPlus - C++ port of the popular Java Lucene library


Lucene++ is an up to date C++ port of the popular Java Lucene library, a high-performance, full-featured text search engine.

SenseiDB - Distributed, Realtime, Semi-Structured Database from LinkedIn


Sensei is a distributed data system that was built to support many product initiatives at LinkedIn, including the real-time faceted search in LinkedIn Signal and the news feed and tabs on the Homepage. Sensei is both a search engine and a database. It is designed to query and navigate through documents that consist of unstructured text and well-formed and structured metadata. Sensei is both a search engine and a database.

Bobo - Faceted search library based on Lucene


Bobo Browse is an information retrieval technology that provides navigational browsing into a semi-structured dataset. Beyond the result set from queries and selections, Bobo Browse also provides the facets from this point of browsing. It provides support to sort documents on fields that have multiple values. It is stable and used by LinkedIn.

ASPseek


ASPseek is an Internet search engine software developed by SWsoft.ASPseek consists of an indexing robot, a search daemon, and a CGI search frontend. It can index as many as a few million URLs and search for words and phrases, use wildcards, and do a Boolean search. Search results can be limited to time period given, site or Web space (set of sites) and sorted by relevance (PageRank is used) or date.



mnoGoSearch


mnoGoSearch for UNIX consists of a command line indexer and a search program which can be run under Apache Web Server, or any other HTTP server supporting CGI interface. mnoGoSearch for Unix is distributed in sources and can be compiled with a number of databases, depending on user's choice. It is known to work on a wide variety of the modern Unix operating systems including Linux, FreeBSD, Mac OSX, Solaris and others.

Heritrix


Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project. Heritrix is designed to respect the robots.txt exclusion directives and META robots tags, and collect material at a measured, adaptive pace unlikely to disrupt normal website activity.

Puggle


Puggle is an open-source desktop search engine written exclusively in Java. It provides full text and metadata search over files, folders, music, photos, web pages and more that are stored locally on your computer. Moreover, a portable edition of Puggle is available that you can carry around with you on a portable device, like a USB flash drive or an external hard drive. Puggle will index all the data contained in and allow you to search over them on any computer.

Sphinix - Search server


Sphinix is free open-source SQL full-text search engine. How do you implement full-text search for that 10+ million row table, keep up with the load, and stay relevant? Sphinx is good at those kinds of riddles.

Katta - Lucene and more in the cloud.


Katta is a scalable, failure tolerant, distributed, data storage for real time access. Katta serves large, replicated, indices as shards to serve high loads and very large data sets. These indices can be of different type. Currently implementations are available for Lucene and Hadoop mapfiles.

HubbleDotNet - Open source full-text search engine


HubbleDotNet is an open source full text search database project based on .net framework. Fast, precise, SQL supported and easy to use.

FSSearchIndexFX - A cross platform information retrieval API framework


FSSearchIndexFX is a cross platform Information Retrieval (IR) framework written in C# and supports both Windows and Mac OSX OSes It aims at developers writing or looking for some basic infrastructure API needed to perform IR tasks such as searching and indexing of text content.

Rosetta Engine


Rosetta is a full text engine entirely written in C++ with dotnet binding. It will provide same functionnaly than lucene with more flexibility, speed and scale up.

riot - Go Open Source, Distributed, Simple and efficient Search Engine


Supporting riot, buy me a coffee.Riot is primarily distributed under the terms of the Apache License (Version 2.0), base on wukong.

scout - RESTful search server written in Python, powered by SQLite.


scout is a RESTful search server written in Python. The search is powered by SQLite's full-text search extension, and the web application utilizes the Flask framework.Scout can be installed from PyPI using pip or from source using git. Should you install from PyPI you will run the latest version, whereas installing from git ensures you have the latest changes.

manticore - Manticore Search is a fork of Sphinx


Manticore Search is an open source search server designed to be fast, scalable and with powerful and accurate full-text search capabilities. It is a fork of popular search engine Sphinx.For more details see installation.