Displaying 1 to 20 from 22 results

Yioop - Open Source Search Engine Software

  •    PHP

Yioop is an open source, PHP search engine capable of crawling, index, and providing search results for hundred of millions of pages on relatively low end hardware. It can index a variety of text formats HTML, RSS, PDF, RTF, DOC and images GIF, JPEG, PNG, etc. It can import data from ARC, WARC, Media-Wiki, Open Directory RDF. It is easily localized to many languages. It has built-in support for new feeds, discussion groups, blogs, and wikis. It also supports mixing indexes to create mash ups.

Xapian - Search Engine Library

  •    C++

Xapian is an Open Source Search Engine Library. It is written in C++, with bindings to allow use from Perl, Python, PHP, Java, Tcl, C# and Ruby. Xapian is a highly adaptable toolkit which allows developers to easily add advanced indexing and search facilities to their own applications. It supports the Probabilistic Information Retrieval model and also supports a rich set of boolean query operators.

Jumper - Collaborative search engine in PHP

  •    PHP

Jumper 2.0 is a collaborative community search platform that revolutionizes search by crowdsourcing knowledge management powered by a shared bookmarking engine. It is easily and quickly deployed into a community of practice that benefits users with complex and specialized search requirements. Jumper delivers universal search of any databases, flat files, fileshares, content systems, web pages, blogs and wikis, even people - through one simple search box.




Sphinix - Search server

  •    C++

Sphinix is free open-source SQL full-text search engine. How do you implement full-text search for that 10+ million row table, keep up with the load, and stay relevant? Sphinx is good at those kinds of riddles.

golucene - Go (Golang) port of Apache Lucene

  •    Go

Is Lucy faster than Lucene? It's written in C, after all. That depends. As of this writing, Lucy launches faster than Lucene thanks to tighter integration with the system IO cache, but Lucene is faster in terms of raw indexing and search throughput once it gets going. These differences reflect the distinct priorities of the most active developers within the Lucy and Lucene communities more than anything else.

Zoekt - Fast trigram based code search

  •    Go

This is a fast text search engine, intended for use with source code. (Pronunciation: roughly as you would pronounce "zooked" in English)


riot - Go Open Source, Distributed, Simple and efficient Search Engine

  •    Go

Supporting riot, buy me a coffee.Riot is primarily distributed under the terms of the Apache License (Version 2.0), base on wukong.

MHTextSearch - A fast full-text search library for Objective-C

  •    Objective-C

A fast & minimal embedded full-text indexing library, written in Objective-C, built on top of Objective-LevelDB.By far, the easiest way to integrate this library in your project is by using CocoaPods.

flow-indexer - Flow-Indexer indexes flows found in chunked log files from bro,nfdump,syslog, or pcap files

  •    Go

The deciding factor for how to partition the databases is how many unique ips you see per day. I suggest starting with monthly indexes. If the indexing performance takes a huge hit by the end of the month, switch to daily indexes. the indexall command will expand file_glob and index any log file that matches.

Lens - 🔍 Lens is an opt-in search engine and data collection tool to aid content discovery of the distributed web

  •    Go

Lens is an opt-in search engine and data collection tool to aid content discovery of the distributed web. Initially integrated with TEMPORAL, Lens will allow users to optionally have the data they upload be searched and indexed and be awarded with RTC for participating in the data collection process. Users can then search for "keywords" of content, such as "document" or "api". Lens will then use this keyword to retrieve all content which matched. Searching through Lens will be facilitated through the TEMPORAL web interface. Optionally, we will have a service independent from TEMPORAL which users can submit content to have it be indexed. This however, is not compensated with RTC. In order to receive the RTC, you must participate through Lens indexing within the TEMPORAL web interface.

boogeyman - An extra search package

  •    Go

A simple program that help you get search results from multi search engines instead of just from google. Return top result of each search engines.

blast - Blast is a full text search and indexing server. It is written in Go built on top of Bleve.

  •    Go

Blast is a full text search and indexing server written in Go built on top of the Bleve. It provides functions through gRPC (HTTP/2 + Protocol Buffers) or traditional RESTful API (HTTP/1.1 + JSON). Blast uses Raft consensus algorithm to achieve consensus across all the instances of the nodes, ensuring that every change made to the system is made to a quorum of nodes, or none at all. Blast makes it easy for programmers to develop search applications with advanced features. Blast requires Bleve and Bleve Extensions. Some Bleve Extensions requires C/C++ libraries. The following sections are instructions for satisfying dependencies on particular platforms.

doogle - Web search of the people, by the people, for the people with Go.

  •    Go

Web search of the people, by the people, for the people with Go. doogle is a Proof of Concept software of decentralized search engine based on gRPC written in Go.

cyberd - Knowledge consensus computer in Go

  •    Go

To run a full-node or validator in the latest public testnet of the cyberd follow the guide. If you have any problems with or questions about search, please contact us through a GitHub issue.

distributed-go - Source code for examples in Distributed Computing with Go

  •    Go

This repository contains the source code for my book Distributed Computing with Go. The mission of Distributed computing with Go is to make reasoning about concurrency and parallelism, effortless and provide the reader with the confidence to design and implement such programs in Go. We will start by digging into the core concepts behind goroutines and channels, the two fundamental concepts in Go around which the language is built. Next, we will design and build a distributed search engine using Go and Go's standard library.

minsql - Distributed SQL based search engine for log data

  •    Go

Massively Parallel Processing (MPP) log database with a simple HTTP API. For usage instructions, click here. MinSQL stores data in Columnar-Parquet format ordered by time. It is built on top of object storage for persistence. Logs can grow indefinitely across multiple MinIO clusters.