EfficientDet: Scalable and Efficient Object Detection, in PyTorch

A PyTorch implementation of EfficientDet from the 2019 paper by Mingxing Tan Ruoming Pang Quoc V. Le Google Research, Brain Team. The official and original: comming soon.

Fun with Demo:

python demo.py --weight ./checkpoint_VOC_efficientdet-d1_97.pth --threshold 0.6 --iou_threshold 0.5 --cam --score

Recent Update
Benchmarking
Installation
Installation
Prerequisites
Datasets
Train
Evaluate
Performance
Demo
Future Work
Reference

Recent Update

[06/01/2020] Support both DistributedDataParallel and DataParallel, change augmentation, eval_voc
[17/12/2019] Add Fast normalized fusion, Augmentation with Ratio, Change RetinaHead, Fix Support EfficientDet-D0->D7
[7/12/2019] Support EfficientDet-D0, EfficientDet-D1, EfficientDet-D2, EfficientDet-D3, EfficientDet-D4,... . Support change gradient accumulation steps, AdamW.

Benchmarking

We benchmark our code thoroughly on three datasets: pascal voc and coco, using family efficientnet different network architectures: EfficientDet-D0->7. Below are the results:

1). PASCAL VOC 2007 (Train/Test: 07trainval/07test, scale=600, ROI Align)

model	mAP
[EfficientDet-D0(with Weight)](https://drive.google.com/file/d/1r7MAyBfG5OK_9F_cU8yActUWxTHOuOpL/view?usp=sharing	62.16

Installation

Install PyTorch by selecting your environment on the website and running the appropriate command.
Clone this repository and install package prerequisites below.
Then download the dataset by following the instructions below.
Note: For training, we currently support VOC and COCO, and aim to add ImageNet support soon.

prerequisites

Python 3.6+
PyTorch 1.3+
Torchvision 0.4.0+ (We need high version because Torchvision support nms now.)
requirements.txt

Datasets

To make things easy, we provide bash scripts to handle the dataset downloads and setup for you. We also provide simple dataset loaders that inherit torch.utils.data.Dataset, making them fully compatible with the torchvision.datasets API.

VOC Dataset

PASCAL VOC: Visual Object Classes

Download VOC2007 + VOC2012 trainval & test

# specify a directory for dataset to be downloaded into, else default is ~/data/
sh datasets/scripts/VOC2007.sh
sh datasets/scripts/VOC2012.sh

COCO

Microsoft COCO: Common Objects in Context

Download COCO 2017

# specify a directory for dataset to be downloaded into, else default is ~/data/
sh datasets/scripts/COCO2017.sh

Training EfficientDet

To train EfficientDet using the train script simply specify the parameters listed in train.py as a flag or manually change them.

python train.py --network effcientdet-d0  # Example

With VOC Dataset:

# DataParallel
python train.py --dataset VOC --dataset_root /root/data/VOCdevkit/ --network effcientdet-d0 --batch_size 32 
# DistributedDataParallel with backend nccl
python train.py --dataset VOC --dataset_root /root/data/VOCdevkit/ --network effcientdet-d0 --batch_size 32 --multiprocessing-distributed

With COCO Dataset:

# DataParallel
python train.py --dataset COCO --dataset_root ~/data/coco/ --network effcientdet-d0 --batch_size 32
# DistributedDataParallel with backend nccl
python train.py --dataset COCO --dataset_root ~/data/coco/ --network effcientdet-d0 --batch_size 32 --multiprocessing-distributed

Evaluation

To evaluate a trained network:

With VOC Dataset:

python eval_voc.py --dataset_root ~/data/VOCdevkit --weight ./checkpoint_VOC_efficientdet-d0_261.pth

With COCO Dataset comming soon.

Demo

python demo.py --threshold 0.5 --iou_threshold 0.5 --score --weight checkpoint_VOC_efficientdet-d1_34.pth --file_name demo.png

Output:

Webcam Demo

You can use a webcam in a real-time demo by running:

python demo.py --threshold 0.5 --iou_threshold 0.5 --cam --score --weight checkpoint_VOC_efficientdet-d1_34.pth

Performance

TODO

We have accumulated the following to-do list, which we hope to complete in the near future

Still to come:
- EfficientDet-[D0-7]
- GPU-Parallel
- NMS
- Soft-NMS
- Pretrained model
- Demo
- Model zoo
- TorchScript
- Mobile
- C++ Onnx

Authors

Toan Dao Minh

Note: Unfortunately, this is just a hobby of ours and not a full-time job, so we'll do our best to keep things up to date, but no guarantees. That being said, thanks to everyone for your continued help and feedback as it is really appreciated. We will try to address everything as soon as possible.

References

tanmingxing, rpang, qvl, et al. "EfficientDet: Scalable and Efficient Object Detection." EfficientDet.
A list of other great EfficientDet ports that were sources of inspiration:

Citation

@article{efficientdetpytoan,
    Author = {Toan Dao Minh},
    Title = {A Pytorch Implementation of EfficientDet Object Detection},
    Journal = {github.com/toandaominh1997/EfficientDet.Pytorch},
    Year = {2019}
}

Implementation EfficientDet: Scalable and Efficient Object Detection in PyTorch

Related tags

Overview

EfficientDet: Scalable and Efficient Object Detection, in PyTorch

Fun with Demo:

Table of Contents

Recent Update

Benchmarking

Installation

prerequisites

Datasets

VOC Dataset

Download VOC2007 + VOC2012 trainval & test

COCO

Download COCO 2017

Training EfficientDet

Evaluation

Demo

Webcam Demo

Performance

TODO

Authors

References

Citation

Owner

tonne

GND-Nets (Graph Neural Diffusion Networks) in TensorFlow.

BERTMap: A BERT-Based Ontology Alignment System

Code for a seq2seq architecture with Bahdanau attention designed to map stereotactic EEG data from human brains to spectrograms, using the PyTorch Lightning.

Pytorch implementation of SenFormer: Efficient Self-Ensemble Framework for Semantic Segmentation

[CVPR22] Official codebase of Semantic Segmentation by Early Region Proxy.

Intel® Neural Compressor is an open-source Python library running on Intel CPUs and GPUs

The "breathing k-means" algorithm with datasets and example notebooks

[Preprint] "Chasing Sparsity in Vision Transformers: An End-to-End Exploration" by Tianlong Chen, Yu Cheng, Zhe Gan, Lu Yuan, Lei Zhang, Zhangyang Wang

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

Code and dataset for ACL2018 paper "Exploiting Document Knowledge for Aspect-level Sentiment Classification"

[ICML 2021] Break-It-Fix-It: Learning to Repair Programs from Unlabeled Data

DeepFill v1/v2 with Contextual Attention and Gated Convolution, CVPR 2018, and ICCV 2019 Oral

This is a Pytorch implementation of paper: DropEdge: Towards Deep Graph Convolutional Networks on Node Classification

Official page of Struct-MDC (RA-L'22 with IROS'22 option); Depth completion from Visual-SLAM using point & line features

Implements the training, testing and editing tools for "Pluralistic Image Completion"

The implementation of the algorithm in the paper "Safe Deep Semi-Supervised Learning for Unseen-Class Unlabeled Data" published in ICML 2020.

The pytorch implementation of SOKD (BMVC2021).

An essential implementation of BYOL in PyTorch + PyTorch Lightning

A dead simple python wrapper for darknet that works with OpenCV 4.1, CUDA 10.1

Image-to-Image Translation with Conditional Adversarial Networks (Pix2pix) implementation in keras