Displaying 1 to 20 from 22 results

chronos - Fault tolerant job scheduler for Mesos which handles dependencies and ISO8601 based schedules

  •    Scala

Chronos is a replacement for cron. It is a distributed and fault-tolerant scheduler that runs on top of Apache Mesos that can be used for job orchestration. It supports custom Mesos executors as well as the default command executor. Thus by default, Chronos executes sh (on most systems bash) scripts. Chronos can be used to interact with systems such as Hadoop (incl. EMR), even if the Mesos agents on which execution happens do not have Hadoop installed. Included wrapper scripts allow transfering files and executing them on a remote machine in the background and using asynchronous callbacks to notify Chronos of job completion or failures. Chronos is also natively able to schedule jobs that run inside Docker containers.

dkron - Dkron - Distributed, fault tolerant job scheduling system http://dkron.io

  •    Go

Dkron is written in Go and leverage the power of distributed key-value stores and serf for providing fault tolerance, reliability and scalability while keeping simple and easily instalable. Dkron is inspired by the google whitepaper Reliable Cron across the Planet and by Airbnb Chronos borrowing the same features from it.

cronsun - A Distributed, Fault-Tolerant Cron-Style Job System.

  •    Go

cronsun is a distributed cron-style job system. It's similar with crontab on stand-alone *nix. The goal of this project is to make it much easier to manage jobs on lots of machines and provides high availability. cronsun is different from Azkaban, Chronos, Airflow.

Akka - Build Concurrent and Scalable Applications

  •    Java

Akka is the platform for the next generation event-driven, scalable and fault-tolerant architectures on the JVM. It helps to write simpler correct concurrent applications using Actors, STM & Transactors. It could scale out on multi-core or multiple nodes using asynchronous message passing. For fault-tolerance it adopts the Let it crash or Embrace failure model to build applications that self-heals, systems that never stop.




ElixirBooks - List of Elixir books

  •    

Official Elixir starting guide that will take you through the language foundations. You will also explore how to build projects with Mix and OTP, and it will introduce you to more advanvced techniques suchs as meta-programming. Adoption is more than programming. Elixir is an exciting new language, but to successfully get your application from start to finish, you’re going to need to know more than just the language. You need the case studies and strategies in this book. Learn the best practices for the whole life of your application, from design and team-building, to managing stakeholders, to deployment and monitoring. Go beyond the syntax and the tools to learn the techniques you need to develop your Elixir application from concept to production.

Ribbon - IPC library with built in software load balancers

  •    Java

Ribbon is a client side IPC library that is battle-tested in cloud. It provides Load balancing, Fault tolerance, Multiple protocol (HTTP, TCP, UDP), support in an asynchronous and reactive model Caching and batching.


PySyncObj - A library for replicating your python class between multiple servers, based on raft protocol

  •    Python

And thats all! Now you can call incCounter on serverA, and check counter value on serverB - they will be synchronized.You can look at batteries implementation, examples and unit-tests for more use-cases. Also there is an API documentation. Feel free to create proposals and/or pull requests with new batteries, features, etc. Join our gitter chat if you have any questions.

distributed-consensus-reading-list - List of academic papers on distributed consensus

  •    

This markdown file contains a list of academic papers (and other works) in the field of distributed consensus. Many of the papers listed below fit into more than one section. However, for simplicity, each paper is listed only in the most relevant section. Where possible, open access links for each paper have been provided. Contributions are welcome. This section lists theoretical results relating to distributed consensus.

Thespian - Python Actor concurrency library

  •    Python

This library provides the framework of an Actor model for use by applications implementing Actors.Alternate fork: https://github.com/kquick/Thespian.

ios - Reliable distributed agreement service for the cloud

  •    Go

Welcome to Ios, a reliable distributed agreement service for cloud applications. Built upon a novel decentralised consensus protocol, Ios provides vital services for your cloud application such as distributed locking, consistent data structures and leader election as well as distributed configuration and coordination.This repository is pre-alpha and under active development. APIs will be broken. This code has not been proven correct and is not ready for production deployment.

abstract_feature_branch - abstract_feature_branch is a gem that enables developers to easily branch by abstraction

  •    Ruby

It is a productivity and fault tolerance enhancing team practice that has been utilized by professional software development teams at large corporations, such as Sears and Groupon. It provides the ability to wrap blocks of code with an abstract feature branch name, and then specify in a configuration file which features to be switched on or off.

circuit-breaker - Circuit breaker for Clojure

  •    Clojure

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

ganesha - :elephant: PHP implementation of Circuit Breaker pattern.

  •    PHP

Ganesha is PHP implementation of Circuit Breaker pattern which has multi strategies to avoid cascading failures and supports various storages to record statistics. It is one of the very active Circuit Breaker in PHP and production ready: well-tested, well-documented. 💪 You can integrate Ganesha to your existing code base easily as Ganesha provides just simple interface and Guzzle Middleware behaves transparency.

bond-cni - Bond-cni is for fail-over and high availability of networking in cloudnative orchestration

  •    Go

This plugin is recommended to be built with Go 1.7.5 which has been fully tested. Note: In this example configuration above required "ipam" is provided by flannel plugin implicitly.

metronome - Metronome is a distributed and fault-tolerant event scheduler

  •    Go

Metronome is a distributed and fault-tolerant event scheduler. It can be used to trigger remote systems throught events (HTTP, AMQP, KAFKA). Metronome is written in Go and leverage the power of kafka streams to provide fault tolerance, reliability and scalability.

stable-systems-checklist - An opinionated list of attributes and policies that need to be met in order to establish a stable software system

  •    

Below is an opinionated list of attributes and policies that need to be met in order to establish a stable software system. To the extent possible under law, Theo Despoudis has waived all copyright and related or neighboring rights to this work.