PyHive is a collection of Python DB-API and SQLAlchemy interfaces for Presto and Hive.First install this package to register it with SQLAlchemy (see setup.py).
https://github.com/dropbox/PyHiveTags | hive hiveserver2 presto dbapi sqlalchemy |
Implementation | Python |
License | Apache |
Platform | Windows Linux |
Python client for HiveServer2 implementations (e.g., Impala, Hive) for distributed query engines. For higher-level Impala functionality, including a Pandas-like interface over distributed data sets, see the Ibis project.
Ansible playbook that installs a CDH 4.6.0 Hadoop cluster (running on Java 7, supported from CDH 4.4), with HBase, Hive, Presto for analytics, and Ganglia, Smokeping, Fluentd, Elasticsearch and Kibana for monitoring and centralized log indexing. Follow @analytically. Browse the CI build screenshots.
Genie is a federated job orchestration engine developed by Netflix. Genie provides REST-ful APIs to run a variety of big data jobs like Hadoop, Pig, Hive, Spark, Presto, Sqoop and more. It also provides APIs for managing the metadata of many distributed processing clusters and the commands and applications which run on them.See the official website to find documentation about Genie and specific documentation for various releases.
big-data bigdata orchestration configuration configuration-management spring-boot distributed-systems netflixossSpecialised plugins for Hadoop, Big Data & NoSQL technologies, written by a former Clouderan (Cloudera was the first Hadoop Big Data vendor) and modern Hortonworks partner/consultant. Supports a a wide variety of compatible Enterprise Monitoring systems.
nagios-plugins zookeeper hadoop hbase cloudera hbase-client jenkins travis-ci nagios-plugin hortonworks ambari cassandra elasticsearch docker kafka solr redis rabbitmq consul datastaxPresto is a powerful interactive querying engine that enables running SQL queries on anything -- be it MySQL, HDFS, local file, Kafka -- as long as there exist a connector to the source. This is a Presto connector to the Ethereum blockchain data. With this connector, one can get hands on with Ethereum blockchain analytics work without having to know how to play with the nitty gritty Javascript API.
presto prestodb ethereum ethereum-blockchain blockchain sqlPyDal is a pure Python database abstraction layer. It provides a DBAPI 2.0 wrapper for DBAPI 2.0 drivers. Sounds strange, but even drivers that fully conform to the API can differ enough to make building database independent applications difficult.
Flask-SQLAlchemy is an extension for Flask that adds support for SQLAlchemy to your application. It aims to simplify using SQLAlchemy with Flask by providing useful defaults and extra helpers that make it easier to accomplish common tasks.
flask sqlalchemy flask-sqlalchemyPresto is an open source distributed SQL query engine for running interactive analytic queries against data sources of all sizes ranging from gigabytes to petabytes. It allows querying data from relational / nosql databases. A single Presto query can combine data from multiple sources, allowing for analytics across your entire organization. It is developed by Facebook.
query-engine database-tool analytics big-data distributedThis library enables Apache Hive to read and write in JSON format. It includes support for serialization and deserialization (SerDe) as well as JSON conversion UDF. Download the latest binaries (json-serde-X.Y.Z-jar-with-dependencies.jar and json-udf-X.Y.Z-jar-with-dependencies.jar) from congiu.net/hive-json-serde. Choose the correct verson for CDH 4, CDH 5 or Hadoop 2.3. Place the JARs into hive/lib or use ADD JAR in Hive.
Shark is an open source distributed SQL query engine for Hadoop data. It brings state-of-the-art performance and advanced analytics to Hive users. It runs Hive queries up to 100x faster in memory, or 10x on disk. it is a large-scale data warehouse system for Spark designed to be compatible with Apache Hive.
distributed-sql hive big-dataFork from http://code.google.com/p/sqlalchemy-migrate/ to get it working with SQLAlchemy 0.8.Inspired by Ruby on Rails' migrations, Migrate provides a way to deal with database schema changes in SQLAlchemy projects.
The Apache Hive (TM) data warehouse software facilitates querying and managing large datasets residing in distributed storage.
Licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. Versioning and auditing extension for SQLAlchemy.
sqlalchemy rdbms databases orm awesome-list awesome listThis Git remote helper makes Dropbox act like a true Git remote. It maintains all guarantees that are provided by a traditional Git remote while using Dropbox as a backing store. This means that it works correctly even when there are multiple people operating on the repository at once, making it possible to use a Dropbox shared folder as a Git remote for collaboration.Once the helper is installed, using it is as simple as adding a remote like dropbox:///path/to/repo.
Presto Web provides a complete content management system with a packages system that let you build your website as you want (only what you need)
Presto Poll is an excellent addition to any website. It supports several different databases, using SimpleXML, PHP, and flatfiles. It has an administration module where modification of all of the settings is possible. The poll result itself includes both
Presto is a music notation program aimed at making it easy for a composer to rapidly convert a musical sketch (melody, counterpoint, harmonic support and bass) into a complete musical arrangement.
These docker images are tested by hundreds of tools and also used in the full functional test suites of various other GitHub repos. These images are all available pre-built on My DockerHub - https://hub.docker.com/u/harisekhon/.
hadoop hbase cassandra solr solrcloud kafka consul superset zookeeper apache-drill nifi docker-image dockerhub docker rabbitmq-cluster nagios-plugins spark presto rabbitmq
We have large collection of open source products. Follow the tags from
Tag Cloud >>
Open source products are scattered around the web. Please provide information
about the open source projects you own / you use.
Add Projects.