Production Grade Machine Learning Service

Last update: Apr 04, 2022

Overview

Production Grade Machine Learning Service

Stack

● Flask as the web framework.

● Redis for a fast loading of the trained model and other data between the workers.

● NGINX as a web server and reverse proxy.

● Gunicorn automatically creates parallel workers/threads according to the capacity of the machine it is running on.

● Celery to support asynchronous time-consuming requests as training and initializing the ML model.

Important Info

● Made to help you scale from a basic Machine Learning project for research purposes to a production grade Machine Learning web service.
● General purpose project, so it assumes that your service needs initialization, training, saving models to the databases for further usage in estimation.
● Based on Docker, so it could be scalable and OS-agnostic.

For the detailed API, use the file ml-service.yml on any swagger editor, and you will see the API definition.

You can find a postman collection of this service in the file MLServiceStructure.postman_collection.json, use it to validate your deployment.

[auth_info]
expiry=XXXX
expiry_time_unit=XXXX

expiry is basically the amount of time in expiry_time_unit for the generated bearer tokens to expire. example:

[auth_info]
expiry=120
expiry_time_unit=seconds

Also Don't forget to create the file ./redis/config.properties , use the following template to add the redis information:

MASTER_USER=XXXXX
REDIS_MASTER_PW=XXXXX
REDIS_CELERY_PW=XXXXX
HOST=redis
END_FILE=true

There are no restrictions about the values of XXXX in this file, you can use your own or use the following example:

MASTER_USER=master_user
REDIS_MASTER_PW=1234pw!@$
REDIS_CELERY_PW=4321wp!@$
HOST=redis
END_FILE=true

Production Grade Machine Learning Service

Related tags

Overview

Production Grade Machine Learning Service

Stack

● Flask as the web framework.

● Redis for a fast loading of the trained model and other data between the workers.

● NGINX as a web server and reverse proxy.

● Gunicorn automatically creates parallel workers/threads according to the capacity of the machine it is running on.

● Celery to support asynchronous time-consuming requests as training and initializing the ML model.

Important Info

For the detailed API, use the file ml-service.yml on any swagger editor, and you will see the API definition.

You can find a postman collection of this service in the file MLServiceStructure.postman_collection.json, use it to validate your deployment.

Owner

Abdullah Zaiter

This repo includes some graph-based CTR prediction models and other representative baselines.

Automated Machine Learning Pipeline for tabular data. Designed for predictive maintenance applications, failure identification, failure prediction, condition monitoring, etc.

Coursera Machine Learning - Python code

MLBox is a powerful Automated Machine Learning python library.

Automatically build ARIMA, SARIMAX, VAR, FB Prophet and XGBoost Models on Time Series data sets with a Single Line of Code. Now updated with Dask to handle millions of rows.

A python fast implementation of the famous SVD algorithm popularized by Simon Funk during Netflix Prize

Predict the output which should give a fair idea about the chances of admission for a student for a particular university

Anomaly Detection and Correlation library

A webpage that utilizes machine learning to extract sentiments from tweets.

Feature-engine is a Python library with multiple transformers to engineer and select features for use in machine learning models.

A collection of neat and practical data science and machine learning projects

pure-predict: Machine learning prediction in pure Python

MasTrade is a trading bot in baselines3,pytorch,gym

Kubeflow is a machine learning (ML) toolkit that is dedicated to making deployments of ML workflows on Kubernetes simple, portable, and scalable.

Bayesian optimization in JAX

Nixtla is an open-source time series forecasting library.

A Python package to preprocess time series

Test symmetries with sklearn decision tree models

A collection of machine learning examples and tutorials.

Data Version Control or DVC is an open-source tool for data science and machine learning projects