An implementation of RetinaNet in PyTorch.

Last update: Jan 04, 2023

Overview

RetinaNet

An implementation of RetinaNet in PyTorch.

Installation
Training
Evaluation
Todo
Credits

Installation

Install PyTorch and torchvision.
For faster data augmentation, install pillow-simd:

pip uninstall -y pillow
pip install pillow-simd

Training

COCO 2017

First, install pycocotools:

git clone https://github.com/pdollar/coco/
cd coco/PythonAPI
make
python setup.py install
cd ../..
rm -r coco

Then download COCO 2017 into ./datasets/COCO/:

cd datasets
mkdir COCO
cd COCO

If your using wget:

wget http://images.cocodataset.org/zips/train2017.zip &&
wget http://images.cocodataset.org/zips/val2017.zip &&
wget http://images.cocodataset.org/annotations/annotations_trainval2017.zip

If your using aria2c (recommended on for higher bandwidth connections and for allowing resumption of the download. Tune the number of max concurrent downloads (-j) and max connections per server (-x) as needed:

aria2c -x 10 -j 10 http://images.cocodataset.org/zips/train2017.zip &&
aria2c -x 10 -j 10 http://images.cocodataset.org/zips/val2017.zip &&
aria2c -x 10 -j 10 http://images.cocodataset.org/annotations/annotations_trainval2017.zip

unzip *.zip
rm *.zip

Then just run:

python train_coco.py

Pascal VOC

cd datasets
mkdir VOC
cd VOC

wget http://host.robots.ox.ac.uk/pascal/VOC/voc2007/VOCtrainval_06-Nov-2007.tar &&
wget http://host.robots.ox.ac.uk/pascal/VOC/voc2012/VOCtrainval_11-May-2012.tar &&
wget http://host.robots.ox.ac.uk/pascal/VOC/voc2007/VOCtest_06-Nov-2007.tar

aria2c -x 10 -j 10 http://host.robots.ox.ac.uk/pascal/VOC/voc2007/VOCtrainval_06-Nov-2007.tar &&
aria2c -x 10 -j 10 http://host.robots.ox.ac.uk/pascal/VOC/voc2012/VOCtrainval_11-May-2012.tar &&
aria2c -x 10 -j 10 http://host.robots.ox.ac.uk/pascal/VOC/voc2007/VOCtest_06-Nov-2007.tar

tar xf *.tar
rm *.tar

Then just run:

python train_voc.py

Custom Dataset

Lots to write here. 😉

Evaluation

To evaluate an image on a trained model:

python eval.py [checkpoint_path] [image_path]

This will create an image (output.jpg) with bounding box annotations.

Todo

Finish converting the COCO dataset class to work with batches.
Train COCO 2017 for 90,000 iterations and save a reusable checkpoint.
Try training on Pascal VOC and add download instructions.
Produce bounding box outputs for a few sanity check images.
Upload trained weights to Github releases.
Train on the 🔮 magic proprietary dataset ✨ .

An implementation of RetinaNet in PyTorch.

Related tags

Overview

RetinaNet

Installation

Training

COCO 2017

Pascal VOC

Custom Dataset

Evaluation

Todo

Credits

Owner

Conner Vercellino

Meta graph convolutional neural network-assisted resilient swarm communications

Implementation of OmniNet, Omnidirectional Representations from Transformers, in Pytorch

MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble

Computationally Efficient Optimization of Plackett-Luce Ranking Models for Relevance and Fairness

免费获取http代理并生成proxifier配置文件

Official Implementation for Fast Training of Neural Lumigraph Representations using Meta Learning.

[ICCV 2021] Relaxed Transformer Decoders for Direct Action Proposal Generation

Implementation for the IJCAI2021 work "Beyond the Spectrum: Detecting Deepfakes via Re-synthesis"

Unified learning approach for egocentric hand gesture recognition and fingertip detection

MADT: Offline Pre-trained Multi-Agent Decision Transformer

Official PyTorch Implementation of Mask-aware IoU and maYOLACT Detector [BMVC2021]

Learning RGB-D Feature Embeddings for Unseen Object Instance Segmentation

The first public PyTorch implementation of Attentive Recurrent Comparators

Training DALL-E with volunteers from all over the Internet using hivemind and dalle-pytorch (NeurIPS 2021 demo)

PyTorch ,ONNX and TensorRT implementation of YOLOv4

Acoustic mosquito detection code with Bayesian Neural Networks

[ACMMM 2021, Oral] Code release for "Elastic Tactile Simulation Towards Tactile-Visual Perception"

PyTorch EO aims to make Deep Learning for Earth Observation data easy and accessible to real-world cases and research alike.

A large-scale video dataset for the training and evaluation of 3D human pose estimation models

Deep Reinforced Attention Regression for Partial Sketch Based Image Retrieval.