Detectorch - detectron for PyTorch

Last update: Dec 23, 2022

Overview

Detectorch - detectron for PyTorch

(Disclaimer: this is work in progress and does not feature all the functionalities of detectron. Currently only inference and evaluation are supported -- no training) (News: Now supporting FPN and ResNet-101!)

This code allows to use some of the Detectron models for object detection from Facebook AI Research with PyTorch.

It currently supports:

Fast R-CNN
Faster R-CNN
Mask R-CNN

It supports ResNet-50/101 models with or without FPN. The pre-trained models from caffe2 can be imported and used on PyTorch.

Example Mask R-CNN with ResNet-101 and FPN.

Evaluation

Both bounding box evaluation and instance segmentation evaluation where tested, yielding the same results as in the Detectron caffe2 models. These results below have been computed using the PyTorch code:

Model	box AP	mask AP	model id
fast_rcnn_R-50-C4_2x	35.6		36224046
fast_rcnn_R-50-FPN_2x	36.8		36225249
e2e_faster_rcnn_R-50-C4_2x	36.5		35857281
e2e_faster_rcnn_R-50-FPN_2x	37.9		35857389
e2e_mask_rcnn_R-50-C4_2x	37.8	32.8	35858828
e2e_mask_rcnn_R-50-FPN_2x	38.6	34.5	35859007
e2e_mask_rcnn_R-101-FPN_2x	40.9	36.4	35861858

Training

Training code is experimental. See train_fast.py for training Fast R-CNN. It seems to work, but slow.

Installation

First, clone the repo with git clone --recursive https://github.com/ignacio-rocco/detectorch so that you also clone the Coco API.

The code can be used with PyTorch 0.3.1 or PyTorch 0.4 (master) under Python 3. Anaconda is recommended. Other required packages

torchvision (conda install torchvision -c soumith)
opencv (conda install -c conda-forge opencv )
cython (conda install cython)
matplotlib (conda install matplotlib)
scikit-image (conda install scikit-image)
ninja (conda install ninja) (required for Pytorch 0.4 only)

Additionally, you need to build the Coco API and RoIAlign layer. See below.

Compiling the Coco API

If you cloned this repo with git clone --recursive you should have also cloned the cocoapi in lib/cocoapi. Compile this with:

cd lib/cocoapi/PythonAPI
make install

Compiling RoIAlign

The RoIAlign layer was converted from the caffe2 version. There are two different implementations for each PyTorch version:

Pytorch 0.4: RoIAlign using ATen library (lib/cppcuda). Compiled JIT when loaded.
PyTorch 0.3.1: RoIAlign using TH/THC and cffi (lib/cppcuda_cffi). Needs to be compiled with:

cd lib/cppcuda_cffi
./make.sh

Quick Start

Check the demo notebook.

Detectorch - detectron for PyTorch

Related tags

Overview

Detectorch - detectron for PyTorch

Evaluation

Training

Installation

Compiling the Coco API

Compiling RoIAlign

Quick Start

Owner

Ignacio Rocco

Unofficial PyTorch Implementation of "Augmenting Convolutional networks with attention-based aggregation"

🏆 The 1st Place Submission to AICity Challenge 2021 Natural Language-Based Vehicle Retrieval Track (Alibaba-UTS submission)

Tidy interface to polars

MASS (Mueen's Algorithm for Similarity Search) - a python 2 and 3 compatible library used for searching time series sub-sequences under z-normalized Euclidean distance for similarity.

Learning Continuous Image Representation with Local Implicit Image Function

Improving Deep Network Debuggability via Sparse Decision Layers

BEAS: Blockchain Enabled Asynchronous & Secure Federated Machine Learning

CSPML (crystal structure prediction with machine learning-based element substitution)

[CVPR'20] TTSR: Learning Texture Transformer Network for Image Super-Resolution

Unofficial implementation of Perceiver IO: A General Architecture for Structured Inputs & Outputs

An unsupervised learning framework for depth and ego-motion estimation from monocular videos

Demo code for paper "Learning optical flow from still images", CVPR 2021.

Guided Internet-delivered Cognitive Behavioral Therapy Adherence Forecasting

CZU-MHAD: A multimodal dataset for human action recognition utilizing a depth camera and 10 wearable inertial sensors

Unofficial implementation of MLP-Mixer: An all-MLP Architecture for Vision

Roach: End-to-End Urban Driving by Imitating a Reinforcement Learning Coach

this is a lite easy to use virtual keyboard project for anyone to use

시각 장애인을 위한 스마트 지팡이에 활용될 딥러닝 모델 (DL Model Repo)

Process text, including tokenizing and representing sentences as vectors and Applying some concepts like RNN, LSTM and GRU to create a classifier can detect the language in which a sentence is written from among 17 languages.

A minimal yet resourceful implementation of diffusion models (along with pretrained models + synthetic images for nine datasets)