Displaying 1 to 16 from 16 results

JanusGraph - Distributed graph database


JanusGraph is a highly scalable graph database optimized for storing and querying large graphs with billions of vertices and edges distributed across a multi-machine cluster. JanusGraph is a transactional database that can support thousands of concurrent users, complex traversals, and analytic graph queries.

OpenTSDB - A scalable, distributed Time Series Database.


OpenTSDB is a distributed, scalable Time Series Database (TSDB) written on top of HBase. OpenTSDB was written to address a common need: store, index and serve metrics collected from computer systems (network gear, operating systems, applications) at a large scale, and make this data easily accessible and graphable.

Apache Trafodion - Webscale SQL-on-Hadoop solution enabling transactional or operational workloads on Apache Hadoop.


Apache Trafodion is a webscale SQL-on-Hadoop solution enabling transactional or operational workloads on Apache Hadoop. Trafodion builds on the scalability, elasticity, and flexibility of Hadoop. Trafodion extends Hadoop to provide guaranteed transactional integrity, enabling new kinds of big data applications to run on Hadoop.

Kundera - JPA 1.0 ORM library for the Cassandra/Hbase/MongoDB database.


A JPA 2.0 compliant Object-Datastore Mapping Library for NoSQL Datastores. The idea behind Kundera is to make working with NoSQL Databases drop-dead simple and fun. Currently it supports Cassandra, MongoDB, HBase and Relational databases.




hbase-docker - HBase running in Docker


This configuration builds a docker container to run HBase (with embedded Zookeeper) running on the files inside the container. The approach here requires editing the local server's /etc/hosts file to add an entry for the container hostname. This is because HBase uses hostnames to pass connection data back out of the container (from it's internal Zookeeper).

cbass - adding "simple" to HBase


In this example we are just muting "packing" and "unpacking" relying on the custom serialization being done prior to calling cbass, so the data is a byte array, and deserialization is done after the value is returned from cbass, since it will just return a byte array back in this case (i.e. identity function for both). notice the "pluto", it has no columns, which is also fine.


gimel - PayPal's Big Data Processing Framework


Gimel provides unified Data API to access data from any storage like HDFS, GS, Alluxio, Hbase, Aerospike, BigQuery, Druid, Elastic, Teradata, Oracle, MySQL, etc.

hbase-mr-pof - A proof of concept prototype of new HBase + Hadoop Map Reduce integration


A proof of concept prototype of new HBase + Hadoop Map Reduce integration

sparksql-for-hbase - Learn how to use Spark SQL and HSpark connector package to create / query data tables that reside in HBase region servers


Apache HBase is an open source, NOSQL distributed database which runs on top of the Hadoop Distributed File System (HDFS), and is well-suited for faster read/write operations on large datasets with high throughput and low input/output latency. But, unlike relational and traditional databases, HBase lacks support for SQL scripting, data types, etc., and requires the Java API to achieve the equivalent functionality. This journey is intended to provide application developers familiar with SQL, the ability to access HBase data tables using the same SQL commands. You will quickly learn how to create and query the data tables by using Apache Spark SQL and the HSpark connector package. This allows you to take advantage of the significant performance gains from using HBase without having to learn the Java APIs required to traditionally access the HBase data tables.

cmux - A set of commands for managing CDH clusters using Cloudera Manager REST API.


CMUX is a set of commands for managing CDH clusters using Cloudera Manager REST API. Some commands require SSH connection to the managed servers.

hbase-packet-inspector - Analyzes network traffic of HBase RegionServers


hbase-packet-inspector (HPI) is a command-line tool for analyzing network traffic of HBase RegionServers. HPI reads tcpdump files or captures live stream of packets of a network interface to extract the information on client requests and responses.

hbase-region-inspector - A visual dashboard of HBase region statistics


A visual dashboard of HBase region statistics. Download the executable binary that matches the version of your HBase cluster, add execute permission, and start it with the following command-line arguments.

mango - Core utility library & data connectors designed for simpler usage in Scala


Mango is a core utility library written Scala for handing JSON formats, concurrency, logging, hashing, cryptography, reflections, etc., as well as making client connections to various endpoints, including HTTP, Telnet, OpenTSDB, Couchbase, ElasticSearch, HBase, and ZooKeeper. Mango has been extensively used in Kakao for years, and is open-sourced under Apache License. Its name follows Googlers' wisdom that core libraries should be named after tropical fruits.