Model factory is a ML training platform to help engineers to build ML models at scale

Last update: Sep 23, 2022

Related tags

Overview

Model Factory

Machine learning today is powering many businesses today, e.g., search engine, e-commerce, news or feed recommendation. Training high quality ML models is critical to all of these systems.

However, training a model is not trivial. Traditionally, engineers use single devvm to train models. It might be doable if you were only to build a few models. If you are interested in exploring hundreds or even thousands of ideas, repeating the workflow manually will be a painful process.

There are many issues with the above workflow:

Hard to scale
No tracking
No monitor
No end-to-end automation
Not easy to share with others
No centralized model management

The above pain points really slows engineers down when they are developing their ML models. Model factory is a project that targets at addressing the above issues.

Background

There are existing work in the industry which tries to address the above issues as well, e.g., Facebook fblearner, Google Kubeflow.

The key difference between model factory and other projects is that model factory promotes a pure python based authoring experience, while most others uses DAG (Directed Acyclic Graph). The philosophy gives model factory the following advantages:

Easy to learn: there is almost no learning curve. As long as you know how to write python, you know how to use model factory.
More flexible: control flow logic can be easily implemented on it.
Allow communication between nodes: free form communication can be done between operators, which opens up the possibility of building distributed training on top of model factory.

Installation

Please follow the Installation page to deploy model factory in your production or testing environment.

Development Guide

Please follow the Development Guide page to try out your first model factory pipeline.

Model factory is a ML training platform to help engineers to build ML models at scale

Related tags

Overview

Model Factory

Background

Installation

Development Guide

Owner

Arquivos do curso online sobre a estatística voltada para ciência de dados e aprendizado de máquina.

PennyLane is a cross-platform Python library for differentiable programming of quantum computers

Evidently helps analyze machine learning models during validation or production monitoring

Summer: compartmental disease modelling in Python

PySpark ML Bank Churn Prediction

MCML is a toolkit for semi-supervised dimensionality reduction and quantitative analysis of Multi-Class, Multi-Label data

NumPy-based implementation of a multilayer perceptron (MLP)

Machine learning that just works, for effortless production applications

Made in collaboration with Chris George for Art + ML Spring 2019.

Predico Disease Prediction system based on symptoms provided by patient- using Python-Django & Machine Learning

Skoot is a lightweight python library of machine learning transformer classes that interact with scikit-learn and pandas.

MIT-Machine Learning with Python–From Linear Models to Deep Learning

Scikit learn library models to account for data and concept drift.

Tribuo - A Java machine learning library

TorchDrug is a PyTorch-based machine learning toolbox designed for drug discovery

Management of exclusive GPU access for distributed machine learning workloads

Lightning ⚡️ fast forecasting with statistical and econometric models.

30 Days Of Machine Learning Using Pytorch

Python implementation of Weng-Lin Bayesian ranking, a better, license-free alternative to TrueSkill

MasTrade is a trading bot in baselines3,pytorch,gym