Displaying 1 to 20 from 23 results

pipeline - PipelineAI: Real-Time Enterprise AI Platform

  •    HTML

Each model is built into a separate Docker image with the appropriate Python, C++, and Java/Scala Runtime Libraries for training or prediction. Use the same Docker Image from Local Laptop to Production to avoid dependency surprises.

Alluxio - Data orchestration for analytics and machine learning in the cloud

  •    Java

Alluxio (formerly known as Tachyon) is a virtual distributed storage system. It bridges the gap between computation frameworks and storage systems, enabling computation applications to connect to numerous storage systems through a common interface.




Trino - A query engine that runs at ludicrous speed

  •    Java

Trino is a highly parallel and distributed query engine, that is built from the ground up for efficient, low latency analytics. It is an ANSI SQL compliant query engine, that works with BI tools such as R, Tableau, Power BI, Superset and many others. It helps to natively query data in Hadoop, S3, Cassandra, MySQL, and many others, without the need for complex, slow, and error-prone processes for copying the data.

PyHive - Python interface to Hive and Presto. 🐝

  •    Python

PyHive is a collection of Python DB-API and SQLAlchemy interfaces for Presto and Hive.First install this package to register it with SQLAlchemy (see setup.py).

Cube.js — Open-Source Analytical API Platform

  •    Javascript

Cube.js is an open-source analytical API platform. It is primarily used to build internal business intelligence tools or add customer-facing analytics to existing applications. Cube.js was designed to work with serverless data warehouses and query engines like Google BigQuery and AWS Athena. A multi-stage querying approach makes it suitable for handling trillions of data points. Most modern RDBMS work with Cube.js as well and can be further tuned for performance.


presto-ethereum - Presto Ethereum Connector -- SQL on Ethereum

  •    Java

Presto is a powerful interactive querying engine that enables running SQL queries on anything -- be it MySQL, HDFS, local file, Kafka -- as long as there exist a connector to the source. This is a Presto connector to the Ethereum blockchain data. With this connector, one can get hands on with Ethereum blockchain analytics work without having to know how to play with the nitty gritty Javascript API.

MLCraft - Low-code business intelligence tool and a data science workflow

  •    Javascript

MLCraft is an open-source low-code business intelligence tool and a data science workflow. MLCraft was designed to query the data from several data warehouses and run machine learning experiments. Cube.js is used as a primary query layer and makes it suitable for handling trillions of data points. It is a full-stack data science platform that provides everything you need to build, manage and automate machine learning

presto-client-node - Distributed query engine Presto client library for node.js

  •    Javascript

Distributed query engine "Presto" 's client library for node.js. Or add presto-client to your own packagen.json, and do npm install.

presto-hdinsight - Presto on Azure HDInsight

  •    Shell

This will connect to hive metastore via hive connector. On a N worker node cluster, you will have N-2 presto worker nodes and 1 coordinator node. The setup also configures TPCH connector, so you can run TPCH queries directly. You will see output like following, note the IP:Port.

tpcds-hdinsight - TPCDS benchmark for various engines

  •    Python

Clone this repo. Run TPCDSDataGen.hql with settings.hql file and set the required config variables.

CerbeHub - Simple, Scalable and Futuristic API & Data Hub for Publishing and Consuming Microservices

  •    Javascript

Simple, Scalable and Futuristic API & Data Hub for Publishing and Consuming Microservices

HAMSuites - High Available Micro Service Suites, with Simple, Scalable and Futuristic API & Data Hub for Publishing and Consuming Microservices 💫 构建高可用微服务,包括可配置的接口生成、性能实验室等

  •    Javascript

High Available Micro Service Suites, with Simple, Scalable and Futuristic API & Data Hub for Publishing and Consuming Microservices 💫 构建高可用微服务,包括可配置的接口生成、性能实验室等

presto-go-client - A Presto client for the Go programming language.

  •    Go

A Presto client for the Go programming language. You need a working environment with Go installed and $GOPATH set.

presto-audit

  •    Java

Presto - Audit log

presto-kubernetes - Running Presto on k8s

  •    

Run Presto cluster on Kubernetes. Clone this.

tpch-hdinsight - TPCH benchmark for various engines

  •    Python

Clone this repo. Run TPCHDataGen.hql with settings.hql file and set the required config variables.






We have large collection of open source products. Follow the tags from Tag Cloud >>


Open source products are scattered around the web. Please provide information about the open source projects you own / you use. Add Projects.