IPython Notebook(s) demonstrating deep learning functionality.IPython Notebook(s) demonstrating scikit-learn functionality.
machine-learning deep-learning data-science big-data aws tensorflow theano caffe scikit-learn kaggle spark mapreduce hadoop matplotlib pandas numpy scipy kerasMobius provides C# language binding to Apache Spark enabling the implementation of Spark driver program and data processing operations in the languages supported in the .NET framework like C# or F#.For more code samples, refer to Mobius\examples directory or Mobius\csharp\Samples directory.
spark apache-spark rdd dataframe dstream dataset streaming mobius kafka-streaming spark-streaming fsharp bigdata mapreduce eventhubs near-real-timeCorral is a MapReduce framework designed to be deployed to serverless platforms, like AWS Lambda. It presents a lightweight alternative to Hadoop MapReduce. Much of the design philosophy was inspired by Yelp's mrjob -- corral retains mrjob's ease-of-use while gaining the type safety and speed of Go. Corral's runtime model consists of stateless, transient executors controlled by a central driver. Currently, the best environment for deployment is AWS Lambda, but corral is modular enough that support for other serverless platforms can be added as support for Go in cloud functions improves.
aws-lambda mapreduce-framework mapreduce serverlessdistributed_computing include mapreduce kvstore etc.
raft mapreduce consistencyA distributed computing platform written in F# and C#. The goal is to have a Peer-to-Peer implementation with automated distribution and replication without a master node.
distributed functional mapreduceAqueduct is a framework for analyzing large data sets by composing small functional building blocks into complex pipeline graphs that are processed as streams.
data-analysis mapreduce streamData Application Platform for Hadoop
unified integration platform dataset mapreduce spark spark-streaming java-8 cdapRequires multiple modules using glob patterns and combines them into a nested object.Returns a promise that resolves to an object containing the required contents of matching globbed files.
dir directory directories file files glob globs map mapreduce multi multiple reduce require treeRun MapReduce in client's browser. An example application can be found inside example/ directory of the source code. The example generates chunks of data constituting person names from an NLTK corpus. The map/reduce prepares a dictionary of alphabets as keys and the number of names starting with the particular alphabet as the value.
mapreduce distributed parallel-processingA Terraform module to create an Amazon Web Services (AWS) Elastic MapReduce (EMR) cluster.
terraform terraform-modules aws amazon-web-services emr mapreduceМасштабируемое машинное обучение и анализ больших данных с Apache Spark
spark mapreduce lectures machine-learning bigdataAsakusa Framework Parent POM
asakusa-framework batch batch-processing hadoop mapreduce data-flow framework big-dataIt's intended to be used as a database for storing metadata in systems that use MQTT as message bus, I'm using it in conjunction with mqtt-smarthome, but I think it could be useful in other MQTT based environments also. You can create and modify documents by publishing JSON payloads to MQTT and receive document changes by simply subscribing to certain topics. You can create views by defining map and reduce functions and filter document ids with MQTT style wildcards.
mqtt database json store mapreduce nosql documents metadata viewsis software for RNA-seq analysis. the website.
rna-seq-analysis emr mapreduce alignments ipython rail-rnaA Simple and Efficient Distributed Multidimensional BI Analysis Engine.
mapreduce multidimensional business-intelligence reports data-analysis olap olap-cubeAn ultra-light-weight HBase ORM library that enables [1] object-oriented access of HBase rows (Data Access Object) [2] reading from and/or writing to HBase tables in Hadoop MapReduce jobs [3] writing high-quality test cases for classes that interact with HBase
hbase orm mapreduce hadoop-mapreduce java-annotations object-mapping junit java-libraries column-family hbase-ormGoCollaborate is an universal framework for stream computing and distributed services management that you can easily program with, build extension on, and on top of which you can create your own high performance distributed applications. GoCollaborate absorbs the best practice experience and improves from the popular distributed computing frameworks including✨Hadoop, ✨Spark, ✨ZooKeeper, ✨Dubbo and ✨Kite that helps to ideally provision the computability for large scale data sets with an easy-to-launch setups.
distributed-framework microservices mapreduce hadoop golang-library mattn
We have large collection of open source products. Follow the tags from
Tag Cloud >>
Open source products are scattered around the web. Please provide information
about the open source projects you own / you use.
Add Projects.