CVNets: A library for training computer vision networks

This repository contains the source code for training computer vision models. Specifically, it contains the source code of the MobileViT paper for the following tasks:

Image classification on the ImageNet dataset
Object detection using SSD
Semantic segmentation using Deeplabv3

Note: Any image classification backbone can be used with object detection and semantic segmentation models

Training can be done with two samplers:

Standard distributed sampler
Mulit-scale distributed sampler

We recommend to use multi-scale sampler as it improves generalization capability and leads to better performance. See MobileViT for details.

Installation

CVNets can be installed in the local python environment using the below command:

    git clone [email protected]:apple/ml-cvnets.git
    cd ml-cvnets
    pip install -r requirements.txt
    pip install --editable .

We recommend to use Python 3.6+ and PyTorch (version >= v1.8.0) with conda environment. For setting-up python environment with conda, see here.

Getting Started

General instructions for training and evaluation different models are given here.
Examples for a training and evaluating a specific model are provided in the examples folder. Right now, we support following models.
For converting PyTorch models to CoreML, see README-pytorch-to-coreml.md.

Citation

If you find our work useful, please cite the following paper:

@article{mehta2021mobilevit,
  title={MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer},
  author={Mehta, Sachin and Rastegari, Mohammad},
  journal={arXiv preprint arXiv:2110.02178},
  year={2021}
}

CVNets: A library for training computer vision networks

Related tags

Overview

CVNets: A library for training computer vision networks

Installation

Getting Started

Citation

Owner

Apple

Speckle-free Holography with Partially Coherent Light Sources and Camera-in-the-loop Calibration

CLOOB training (JAX) and inference (JAX and PyTorch)

This program presents convolutional kernel density estimation, a method used to detect intercritical epilpetic spikes (IEDs)

Temporal Segment Networks (TSN) in PyTorch

All the code and files related to the MI-Lab of UE19CS305 course in sem 5

Collection of Docker images for ML/DL and video processing projects

Pre-Training Graph Neural Networks for Cold-Start Users and Items Representation.

Python scripts for performing object detection with the 1000 labels of the ImageNet dataset in ONNX.

CS50x-AI - Artificial Intelligence with Python from Harvard University

Team nan solution repository for FPT data-centric competition. Data augmentation, Albumentation, Mosaic, Visualization, KNN application

An Api for Emotion recognition.

[ WSDM '22 ] On Sampling Collaborative Filtering Datasets

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

WatermarkRemoval-WDNet-WACV2021

EASY - Ensemble Augmented-Shot Y-shaped Learning: State-Of-The-Art Few-Shot Classification with Simple Ingredients.

A Simple and Versatile Framework for Object Detection and Instance Recognition

Fast EMD for Python: a wrapper for Pele and Werman's C++ implementation of the Earth Mover's Distance metric

Planner_backend - Academic planner application designed for students and counselors.

Semantic-aware Grad-GAN for Virtual-to-Real Urban Scene Adaption

Tutorial: Introduction to Graph Machine Learning, with Jupyter notebooks