This is a repository for a Semantic Segmentation inference API using the Gluoncv CV toolkit

Last update: Nov 24, 2022

Overview

BMW Semantic Segmentation GPU/CPU Inference API

This is a repository for a Semantic Segmentation inference API using the Gluoncv CV toolkit.

The training GUI (also based on the Gluoncv CV toolkit ) for the Semantic Segmentation workflow will be published soon.

A sample inference model is provided with this repository for testing purposes.

This repository can be deployed using docker.

Note: To be able to use the sample inference model provided with this repository make sure to use git clone and avoid downloading the repository as ZIP because it will not download the actual model stored on git lfs but just the pointer instead

Prerequisites

Ubuntu 18.04 or 20.04 LTS
Windows 10 pro with hyper-v enabled and docker desktop
NVIDIA Drivers (410.x or higher)
Docker CE latest stable release
NVIDIA Docker 2
Git lfs (large file storage) : installation

Note: the windows deployment supports only CPU version thus nvidia driver and nvidia docker are not required

Check for prerequisites

To check if you have docker-ce installed:

docker --version

To check if you have nvidia-docker2 installed:

dpkg -l | grep nvidia-docker2

To check your nvidia drivers version, open your terminal and type the command nvidia-smi

Install prerequisites

Use the following command to install docker on Ubuntu:

chmod +x install_prerequisites.sh && source install_prerequisites.sh

Install NVIDIA Drivers (410.x or higher) and NVIDIA Docker for GPU by following the official docs

Build The Docker Image

To build the docker environment, run the following command in the project's directory:

For GPU Build:

docker build -t gluoncv_segmentation_inference_api_gpu -f ./GPU/dockerfile .

For CPU Build:

docker build -t gluoncv_segmentation_inference_api_cpu -f ./CPU/dockerfile .

Behind a proxy

For GPU Build:

docker build --build-arg http_proxy='' --build-arg https_proxy='' -t gluoncv_segmentation_inference_api_gpu -f ./GPU/dockerfile .

For CPU Build:

docker build --build-arg http_proxy='' --build-arg https_proxy='' -t gluoncv_segmentation_inference_api_cpu -f ./CPU/dockerfile .

Run the docker container

To run the inference API go the to the API's directory and run the following:

Using Linux based docker:

For GPU:

docker run --gpus '"device=<- gpu numbers seperated by commas ex:"0,1,2" ->"' -itv $(pwd)/models:/models -p <port-of-your-choice>:4343 gluoncv_segmentation_inference_api_gpu

For CPU:

docker run -itv $(pwd)/models:/models -p <port-of-your-choice>:4343 gluoncv_segmentation_inference_api_cpu

For Windows

docker run -itv ${PWD}/models:/models -p <port-of-your-choice>:4343 gluoncv_segmentation_inference_api_cpu

API Endpoints

To see all available endpoints, open your favorite browser and navigate to:

http://<machine_URL>:<Docker_host_port>/docs

The 'predict_batch' endpoint is not shown on swagger. The list of files input is not yet supported.

Endpoints summary

/load (GET)

Loads all available models and returns every model with it's hashed value. Loaded models are stored and aren't loaded again

/detect (POST)

Performs inference on specified model, image, and returns json file

/get_labels (POST)

Returns all of the specified model labels with their hashed values

/models (GET)

Lists all available models

/models/{model_name}/load (GET)

Loads the specified model. Loaded models are stored and aren't loaded again

/models/{model_name}/predict (POST)

Performs inference on specified model, image, and returns json file (exactly like detect)

/models/{model_name}/predict_image (POST)

Performs inference on specified model, image, and returns the image with transparent segments on it.

/models/{model_name}/inference (POST)

Performs inference on specified model,image, and returns the segments only (image)

/models/{model_name}/labels (GET)

Returns all of the specified model labels

/models/{model_name}/config (GET)

Returns the specified model's configuration

Model structure

The folder "models" contains sub-folders of all the models to be loaded.

You can copy your model sub-folder generated after training ( training GUI will be published soon ) , put it inside the "models" folder in your inference repos and you're all set to infer.

The model sub-folder should contain the following :

model_best.params
palette.txt If you don't have your own palette, you can generate a random one using the command below in your project's repository and copy palette.txt to your model directory:

python3 generate_random_palette.py

configuration.json

The configuration.json file should look like the following :

{
    "inference_engine_name" : "gluonsegmentation",
    "backbone": "resnet101",
    "batch-size": 4,
    "checkname": "bmwtest",
    "classes": 3,
    "classesname": [
        "background",
        "pad",
        "circle"
    ],
    "network": "fcn",
    "type":"segmentation",
    "epochs": 10,
    "lr": 0.001,
    "momentum": 0.9,
    "num_workers": 4,
    "weight-decay": 0.0001
}

Acknowledgements

Roy Anwar,Beirut, Lebanon
Hadi Koubeissy, inmind.ai, Beirut, Lebanon

This is a repository for a Semantic Segmentation inference API using the Gluoncv CV toolkit

Related tags

Overview

BMW Semantic Segmentation GPU/CPU Inference API

Prerequisites

Check for prerequisites

Install prerequisites

Build The Docker Image

Behind a proxy

Run the docker container

Using Linux based docker:

API Endpoints

Endpoints summary

/load (GET)

/detect (POST)

/get_labels (POST)

/models (GET)

/models/{model_name}/load (GET)

/models/{model_name}/predict (POST)

/models/{model_name}/predict_image (POST)

/models/{model_name}/inference (POST)

/models/{model_name}/labels (GET)

/models/{model_name}/config (GET)

Model structure

Acknowledgements

Owner

BMW TechOffice MUNICH

One-line your code easily but still with the fun of doing so!

This is the repo for our work "Towards Persona-Based Empathetic Conversational Models" (EMNLP 2020)

The official start-up code for paper "FFA-IR: Towards an Explainable and Reliable Medical Report Generation Benchmark."

Implementation of Segformer, Attention + MLP neural network for segmentation, in Pytorch

[EMNLP 2021] Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-Training

Source code of SIGIR2021 Paper 'One Chatbot Per Person: Creating Personalized Chatbots based on Implicit Profiles'

A PyTorch Library for Accelerating 3D Deep Learning Research

Security evaluation module with onnx, pytorch, and SecML.

Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.

This is a Tensorflow implementation of Learning to See in the Dark in CVPR 2018

PyTorch implementation of Hierarchical Multi-label Text Classification: An Attention-based Recurrent Network

PyZebrascope - an open-source Python platform for brain-wide neural activity imaging in behaving zebrafish

Code for "Discovering Non-monotonic Autoregressive Orderings with Variational Inference" (paper and code updated from ICLR 2021)

Neural style transfer in PyTorch.

3D position tracking for soccer players with multi-camera videos

NCVX (NonConVeX): A User-Friendly and Scalable Package for Nonconvex Optimization in Machine Learning.

这是一个利用facenet和retinaface实现人脸识别的库，可以进行在线的人脸识别。

An implementation of Equivariant e2 convolutional kernals into a convolutional self attention network, applied to radio astronomy data.

Malmo Collaborative AI Challenge - Team Pig Catcher

Official repository for "Orthogonal Projection Loss" (ICCV'21)