Vertex AI: Serverless framework for MLOPs (ESP / ENG)

Español

Qué es esto?

Este repo contiene un pipeline end to end diseñado usando el SDK de Kubeflow Pipelines (KFP). En el contexto del uso de Vertex AI como solución, la idea es construir una arquitectura de machine learning lo más automatizada posible, integrando algunos de los principales servicios de Google Cloud Platform (GCP) tales como BigQuery (data warehousing), Google Cloud Storage (almacenamiento de objetos) y Container Registry (repositorio de inágenes de Docker).

Cómo lo corro?

Primero, ejecutar la notebook pipeline_setup.ipynb. Contiene la configuración de la infraestructura que será utilizada: se crean datasets en BigQuery y buckets en GCS y se instalan librerías necesarias. Además se crean imágenes de Docker y se pushea a Container Registry para los jobs de tuneos de hiperparámetros.
Segundo, dentro de la carpeta components se encuentra la notebook components_definition.ipynb que deberá ejecutarse para generar los .yamls que serán invocados en la notebook principal de ejecución.
Por último, seguir los pasos indicados en pipeline_run.ipynb. Algunos parámetros como la cantidad de trials de hiperparámetros o los tipos de máquina deseadas para algunos pasos pueden ser fácilmente modificables.

TO-DO

agregar costo estimado permisos

English

What is this?

This repo contains an end to end pipeline designed using Kubelow Pipelines SDK (KFP). Using Vertex AI as a main solution, the idea is to build a machine learning architecture as automated as possible, integrating some of the main Google Cloud Platform (GCP) services, such as BigQuery (data warehousing), Google Cloud Storage (storage system) and Container Registry (Docker images repository).

How do I run it?

First, execute pipeline_setup.ipynb. It contains the infraestructure configuration to be used: BigQuery datasets and GCS buckets are created and installs the necessary libraries. It also creates Docker images and pushes them to Container Registry in order to perform hyperparameter tuning jobs.
Second, in the components folder there's a notebook called components_definition.ipynb which should be executed to generate the .yamls to be invoked in the main notebook execution.
Last, follow the steps in pipeline_run.ipynb. Some parameters, as hyperparameter trials or machine types for given steps of the process can be easily modified.

To-do

estimated cost roles

Vertex AI: Serverless framework for MLOPs (ESP / ENG)

Related tags

Overview

Vertex AI: Serverless framework for MLOPs (ESP / ENG)

Español

Qué es esto?

Cómo lo corro?

TO-DO

English

What is this?

How do I run it?

To-do

Owner

Hernán Escudero

In this project we use both Resnet and Self-attention layer for cat, dog and flower classification.

Codebase for "Revisiting spatio-temporal layouts for compositional action recognition" (Oral at BMVC 2021).

Like Dirt-Samples, but cleaned up

Noether Networks: meta-learning useful conserved quantities

CR-FIQA: Face Image Quality Assessment by Learning Sample Relative Classifiability

Improving Transferability of Representations via Augmentation-Aware Self-Supervision

Use AI to generate a optimized stock portfolio

Implementation of a protein autoregressive language model, but with autoregressive infilling objective (editing subsequences capability)

MPViT:Multi-Path Vision Transformer for Dense Prediction

Code from PropMix, accepted at BMVC'21

Differential Privacy for Heterogeneous Federated Learning : Utility & Privacy tradeoffs

VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image Captioning

Traffic4D: Single View Reconstruction of Repetitious Activity Using Longitudinal Self-Supervision

Learning Generative Models of Textured 3D Meshes from Real-World Images, ICCV 2021

Learning RAW-to-sRGB Mappings with Inaccurately Aligned Supervision (ICCV 2021)

Official code of Team Yao at Multi-Modal-Fact-Verification-2022

Conservative Q Learning for Offline Reinforcement Reinforcement Learning in JAX

Estimation of human density in a closed space using deep learning.

Viperdb - A tiny log-structured key-value database written in pure Python

OoD Minimum Anomaly Score GAN - Code for the Paper 'OMASGAN: Out-of-Distribution Minimum Anomaly Score GAN for Sample Generation on the Boundary'