A search engine which can hold 100 trillion lines of log data.
poseidon search-engine big-data map-reducePachyderm is a tool for production data pipelines. If you need to chain together data scraping, ingestion, cleaning, munging, wrangling, processing, modeling, and analysis in a sane way, then Pachyderm is for you. If you have an existing set of scripts which do this in an ad-hoc fashion and you're looking for a way to "productionize" them, Pachyderm can make this easy for you. Install Pachyderm locally or deploy on AWS/GCE/Azure in about 5 minutes.
pachyderm docker analytics big-data containers distributed-systems kubernetes data-science data-analysisTrailDB is a library, implemented in C, which allows you to query series of events at blazing speed. TrailDB is also optimized for speed of development: Use its simple API with your favorite language, in your favorite environment. TrailDB's secret sauce is data compression. It leverages predictability of time-based data to compress your data to a fraction of its original size. In contrast to traditional compression, you can query the encoded data directly, decompressing only the parts you need.
database data-analytics event-data big-data time-series time-series-databaseStatus: In production use at Janelia. See wiki page for outside lab use of DVID. See the DVID Wiki for more information including installation and examples of use.
dataservice image-storage http-service connectomics big-data neuroscience key-valueWarp allows you to convert and analyze (very) large databases with ease at the speed of light. In Warp, you work on a small subset of the data, after which Warp repeats your actions on the entire dataset. Unlike most data analysis apps, you do not have to type any codes in Warp. Effortlessly juggle around data between files and databases by simply dragging-and-dropping! Load CSV files into MySQL or transfer a PostgreSQL table to a RethinkDB table by just dragging one to the other.
rethinkdb mysql postgresql data-analysis big-data sqliteGo client implementation for Hazelcast, the open source in-memory data grid. Go client is implemented using the Hazelcast Open Binary Client Protocol.
hazelcast in-memory datagrid big-data clustering scalability distributed caching imdg hazelcast-client golang-librarySee wiki: Link with CBLAS & LAPACK. This work has been sponsored by Symmetry Investments and Kaleidic Associates.
dlang linear-algebra ndslice numerical-methods matlab octave quantitative-finance big-data numpy native-code high-performance symmetry-investments blas hedgefundA Presto client for the Go programming language. You need a working environment with Go installed and $GOPATH set.
presto prestodb sql big-data
We have large collection of open source products. Follow the tags from
Tag Cloud >>
Open source products are scattered around the web. Please provide information
about the open source projects you own / you use.
Add Projects.