Displaying 1 to 20 from 25 results

Hub - Fastest dataset optimization and management for machine and deep learning

  •    Python

Note: the translations of this document may not be up-to-date. For the latest version, please check the README in English. Software 2.0 needs Data 2.0, and Hub delivers it. Most of the time Data Scientists/ML researchers work on data management and preprocessing instead of training models. With Hub, we are fixing this. We store your (even petabyte-scale) datasets as single numpy-like array on the cloud, so you can seamlessly access and work with it from any machine. Hub makes any data type (images, text files, audio, or video) stored in cloud usable as fast as if it were stored on premise. With same dataset view, your team can always be in sync.

metaflow - Build and manage real-life data science projects with ease.

  •    Python

Metaflow is a human-friendly Python/R library that helps scientists and engineers build and manage real-life data science projects. Metaflow was originally developed at Netflix to boost productivity of data scientists who work on a wide variety of projects from classical statistics to state-of-the-art deep learning. For more information, see Metaflow's website and documentation.




evidently - Interactive reports to analyze machine learning models during validation or production monitoring

  •    Jupyter

Interactive reports and JSON profiles to analyze, monitor and debug machine learning models. Evidently helps evaluate machine learning models during validation and monitor them in production. The tool generates interactive visual reports and JSON profiles from pandas DataFrame or csv files. You can use visual reports for ad hoc analysis, debugging and team sharing, and JSON profiles to integrate Evidently in prediction pipelines or with other visualization tools.

amazon-sagemaker-examples - Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker

  •    Jupyter

Example Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using Amazon SageMaker. Amazon SageMaker is a fully managed service for data science and machine learning (ML) workflows. You can use Amazon SageMaker to simplify the process of building, training, and deploying ML models.

kedro - A Python framework for creating reproducible, maintainable and modular data science code.

  •    Python

Kedro is an open-source Python framework for creating reproducible, maintainable and modular data science code. It borrows concepts from software engineering and applies them to machine-learning code; applied concepts include modularity, separation of concerns and versioning. Our Get Started guide contains full installation instructions, and includes how to set up Python virtual environments.


RUBRIX - Python framework to explore, label, and monitor data for NLP

  •    Python

Rubrix is a production-ready Python framework for exploring, annotating, and managing data in NLP projects. Most annotation tools treat data collection as a one-off activity at the beginning of each project. In real-world projects, data collection is a key activity of the iterative process of ML model development. Once a model goes into production, you want to monitor and analyze its predictions, and collect more data to improve your model over time. Rubrix is designed to close this gap, enabling you to iterate as much as you need.

MLOps - MLOps examples

  •    Python

MLOps empowers data scientists and app developers to help bring ML models to production. MLOps enables you to track / version / audit / certify / re-use every asset in your ML lifecycle and provides orchestration services to streamline managing this lifecycle. Azure ML contains a number of asset management and orchestration services to help you manage the lifecycle of your model training & deployment workflows.

tfx-kubeflow-pipelines - Kubeflow pipelines built on top of Tensorflow TFX library

  •    Jupyter

This repository contains machine learning pipelines based on Tensorflow TFX library. Every pipeline is designed to be published on a Kubernetes/Kubeflow cluster on premise. Further pipelines are welcome via pull request.

kfp-tekton - Kubeflow Pipelines with Tekton backend.

  •    TypeScript

Project bringing Kubeflow Pipelines and Tekton together. The project is driven according to this design doc. The current code allows you run Kubeflow Pipelines with Tekton backend end to end. For more details about the project please follow this detailed blog post. Additionally, look at these slides as well as this deep dive presentation for demos.

great_expectations_action - A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows

  •    Jupyter

Great Expectations is a leading tool for validating, documenting, and profiling your data to maintain quality and improve communication between teams. In order to configure the GitHub action for your repository, add the following code snippet to your GitHub workflows file. The file should be located under my_repo_name/.github/my_workflow.yml.

tempo - MLOps Python Library

  •    Python

An open source framework to enable data scientists to productionise, test and deploy models with simple workflows that abstract the underlying complexity of scalable MLOps platforms. Tempo provides a unified interface to multiple MLOps projects that enable data scientists to deploy and productionise machine learning systems.

merlin - Kubernetes-friendly ML model management, deployment, and serving.

  •    Go

To install Merlin in your local machine, click Local Development. Go to the docs folder for the full documentation and guides.

aml-compute - GitHub Action that allows you to attach, create and scale Azure Machine Learning compute resources

  •    Python

The actions for creating compute for Azure Machine Learning will allow you to create a new compute target on Azure Machine Learning using GitHub Actions. This repository contains a GitHub Action for creating and connecting to Azure Machine Learning compute resources, so you can later train or deploy machine learning models models remotely. If the compute target exists, it will connect to it, otherwise the action can create a new compute target based on the provided parameters. Currently, the action only supports Azure ML Clusters and AKS Clusters.

aml-deploy - GitHub Action that allows you to deploy machine learning models in Azure Machine Learning

  •    Python

The Deploy Machine Learning Models to Azure action will deploy your model on Azure Machine Learning using GitHub Actions. This repository contains GitHub Action for deploying Machine Learning Models to Azure Machine Learning and creates a real-time endpoint on the model to integrate models in other systems. The endpoint can be hosted either on an Azure Container Instance or on an Azure Kubernetes Service.

aml-registermodel - GitHub Action that allows you to register models to your Azure Machine Learning Workspace

  •    Python

The Register Machine Learning Models with Azure action will deploy your model on Azure Machine Learning using GitHub Actions. This repository contains a GitHub Action for registering Machine Learning Models with Azure Machine Learning model registry for use in deployment and testing. This action is designed to register models that may or may not have been trained using Azure Machine Learning. If they are not trained using Azure Machine Learning, we expect the model to be present in your GitHub Repository.

aml-run - GitHub Action that allows you to submit a run to your Azure Machine Learning Workspace.

  •    Python

The Azure Machine Learning training action will help you train your models on Azure Machine Learning using GitHub Actions. This action is one in a series of actions that can be used to setup an ML Ops process. We suggest getting started with one of our template repositories, which will allow you to create an ML Ops process in less than 5 minutes.

aml-workspace - GitHub Action that allows you to create or connect to your Azure Machine Learning Workspace

  •    Python

The aml-workspace action will login / connect to Azure Machine Learning. This repository contains a GitHub Action for connecting to an Azure Machine Learning workspace. You can later use this context to train your model remotely, deploy your models to endpoints etc. You can also use this action to create a new workspace, if you provide the appropriate parameters.






We have large collection of open source products. Follow the tags from Tag Cloud >>


Open source products are scattered around the web. Please provide information about the open source projects you own / you use. Add Projects.