Model Training as a CI/CD System

This project demonstrates the machine model training as a CI/CD system in GCP platform. You will see more detailed workflow in the below section, but it is about rebuilding and redeploying (continuous integration) the currently deployed machine learning pipeline based on changes in code. Such changes could happen in the training data, data pre-processing logic, model architecture and training code, custom pipeline components, and so on.

Workflow #1

We create initial code, or we make some changes in the existing codebase for pipeline.
Based on the changes in the step 2, a GitHub action gets triggered to initiate a Cloud Build process.
The Cloud Build runs unit tests to see if those components work without errors.
If there is no error at all, there are two common sub-workflows from this point.
- Cloud Build containerizes the current codebase. This is an optional step. If you have any custom components unchanges, this step might be omitted.
  - The Cloud Build compiles a new pipeline. It creates an updated docker image, and it uploads the new docker image to GCR
- If there is any codes changed in data preprocessing, modeling, training steps, we only have to upload those source files to designated GCS bucket
The final step of the Cloud Build is to execute a pipeline run on Vertex AI

Workflow #2

Workflow in a nutshell

We create initial code, or we make some changes in the existing codebase for modules.
Based on the changes in the step 2, a GitHub action gets triggered to initiate a Cloud Build process.
The Cloud Build runs unit tests to see if those components work without errors.
If there is no error at all, there are two common sub-workflows from this point.
- If there is any codes changed in data preprocessing and models, we only have to upload those source files to designated GCS bucket.
The final step of the Cloud Build is to execute a pipeline run on Vertex AI. Trainer and Transform TFX components will look up the changed modules accordingly.

Acknowledgements

ML-GDE program for providing GCP credits.

Demonstration of the Model Training as a CI/CD System in Vertex AI

Related tags

Overview

Model Training as a CI/CD System

Workflow #1

Workflow #2

Workflow in a nutshell

Acknowledgements

Owner

Chansung Park

This is a classifier which basically predicts whether there is a gun law in a state or not, depending on various things like murder rates etc.

PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis

Instantaneous Motion Generation for Robots and Machines.

maximal update parametrization (µP)

Weight estimation in CT by multi atlas techniques

📚 Papermill is a tool for parameterizing, executing, and analyzing Jupyter Notebooks.

General Virtual Sketching Framework for Vector Line Art (SIGGRAPH 2021)

Mask-invariant Face Recognition through Template-level Knowledge Distillation

Official Implementation of "Transformers Can Do Bayesian Inference"

Diverse Image Generation via Self-Conditioned GANs

Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System

DEEPAGÉ: Answering Questions in Portuguese about the Brazilian Environment

Deep generative modeling for time-stamped heterogeneous data, enabling high-fidelity models for a large variety of spatio-temporal domains.

Official Pytorch implementation of the paper: "Locally Shifted Attention With Early Global Integration"

The official implementation of the CVPR 2021 paper FAPIS: a Few-shot Anchor-free Part-based Instance Segmenter

A self-supervised 3D representation learning framework named viewpoint bottleneck.

This repository contains the code for: RerrFact model for SciVer shared task

EMNLP 2021: Single-dataset Experts for Multi-dataset Question-Answering

AITom is an open-source platform for AI driven cellular electron cryo-tomography analysis.

An official source code for "Augmentation-Free Self-Supervised Learning on Graphs"