Summa
Summa is a fast modular and scalable search engine written in Java. Summa is characterized by:
- Integrated search. Summa can simultaneously access a number of different data and data sources and expose it in a unified interface.
- Modular design. The Summa search system consists of a set of independent modules, which makes it simple and easy to maintain and upgrade.
- Scalable. Summa supports a distributed architecture and can be scaled up or down to handle any amount of data.
- Open standards. Summa is based upon modern web technologies and standards, and don?t include any proprietary codes or elements.
- Failure tolerant. If a single source of data or service should fail, Summa will continue without that specific source.
http://wiki.statsbiblioteket.dk/summa/
comments powered by Disqus
Related Products
Constellio - Enterprise Search engine
Constellio Open Source Enterprise Search is based on Apache Solr and using Google Search Appliances connectors architecture, it allows, with a single click, to find all relevant content in your organization (Web, email, ECM, CRM etc.).
Solr
Solr is the popular, blazing fast open source enterprise search platform from the Apache Lucene project. Its major features include powerful full-text search, hit highlighting, faceted search, dynamic clustering, database integration, and rich document (e.g., Word, PDF) handling. Solr is highly scalable, providing distributed search and index replication, and it powers the search and navigation features of many of the world's largest internet sites.
IndexTank - Search Engine powers Reddit
IndexTank search engine powers search in Reddit, Social bookmarking site. IndexTank is acquired by LinkedIn and released the project as open source. It includes features like Variables boosts, Facets, Faceted search, Snippeting, Custom scoring functions, Suggest, and Autocomplete.
SenseiDB - Search engine used in LinkedIn
Sensei is a distributed data system that was built to support many product initiatives at LinkedIn, including the real-time faceted search in LinkedIn Signal and the news feed and tabs on the Homepage. Sensei is both a search engine and a database. It is designed to query and navigate through documents that consist of unstructured text and well-formed and structured metadata.
Xapian - Search Engine Library
Xapian is an Open Source Search Engine Library. It is written in C++, with bindings to allow use from Perl, Python, PHP, Java, Tcl, C# and Ruby. Xapian is a highly adaptable toolkit which allows developers to easily add advanced indexing and search facilities to their own applications. It supports the Probabilistic Information Retrieval model and also supports a rich set of boolean query operators.
Bobo - Faceted search library based on Lucene
Bobo Browse is an information retrieval technology that provides navigational browsing into a semi-structured dataset. Beyond the result set from queries and selections, Bobo Browse also provides the facets from this point of browsing. It provides support to sort documents on fields that have multiple values. It is stable and used by LinkedIn.
Blacklight
Blacklight is an open source OPAC (online public access catalog). It is ruby-on-rails based discovery interface (a.k.a. “next-generation catalog”) especially optimized for heterogeneous collections. It could be used as a library catalog, as a front end for a digital repository, or as a single-search interface to aggregate digital content that would otherwise be siloed. Blacklight uses Solr, an enterprise-scale index for its search engine.
compass
Compass is a real time searchengine. It is built on top of lucene. It is transactional, distributed, supports Spring MVC, integrates with Hibernate.
Sphinix
Sphinix is free open-source SQL full-text search engine. How do you implement full-text search for that 10+ million row table, keep up with the load, and stay relevant? Sphinx is good at those kinds of riddles.
ElasticSearch
ElasticSearch is an Open Source (Apache 2 license), distributed, RESTful Search Engine built for the cloud.