Displaying 1 to 2 from 2 results

pachyderm - Reproducible Data Science at Scale!

  •    Go

Pachyderm is a tool for production data pipelines. If you need to chain together data scraping, ingestion, cleaning, munging, wrangling, processing, modeling, and analysis in a sane way, then Pachyderm is for you. If you have an existing set of scripts which do this in an ad-hoc fashion and you're looking for a way to "productionize" them, Pachyderm can make this easy for you. Install Pachyderm locally or deploy on AWS/GCE/Azure in about 5 minutes.

python-pachyderm - Python client for Pachyderm

  •    Python

A python client wrapper for the Pachyderm API. All of the PFS functions used in pachctl are supported (almost) as-is.