PyTorch implementation of ENet

Last update: Dec 29, 2022

Overview

PyTorch-ENet

PyTorch (v1.1.0) implementation of ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation, ported from the lua-torch implementation ENet-training created by the authors.

This implementation has been tested on the CamVid and Cityscapes datasets. Currently, a pre-trained version of the model trained in CamVid and Cityscapes is available here.

Dataset	Classes ¹	Input resolution	Batch size	Epochs	Mean IoU (%)	GPU memory (GiB)	Training time (hours)²
CamVid	11	480x360	10	300	52.1³	4.2	1
Cityscapes	19	1024x512	4	300	59.5⁴	5.4	20

¹ When referring to the number of classes, the void/unlabeled class is always excluded.
² These are just for reference. Implementation, datasets, and hardware changes can lead to very different results. Reference hardware: Nvidia GTX 1070 and an AMD Ryzen 5 3600 3.6GHz. You can also train for 100 epochs or so and get similar mean IoU (± 2%).
³ Test set.
⁴ Validation set.

Installation

Local pip

Python 3 and pip
Set up a virtual environment (optional, but recommended)
Install dependencies using pip: pip install -r requirements.txt

Docker image

Build the image: docker build -t enet .
Run: docker run -it --gpus all --ipc host enet

Usage

Run main.py, the main script file used for training and/or testing the model. The following options are supported:

python main.py [-h] [--mode {train,test,full}] [--resume]
               [--batch-size BATCH_SIZE] [--epochs EPOCHS]
               [--learning-rate LEARNING_RATE] [--lr-decay LR_DECAY]
               [--lr-decay-epochs LR_DECAY_EPOCHS]
               [--weight-decay WEIGHT_DECAY] [--dataset {camvid,cityscapes}]
               [--dataset-dir DATASET_DIR] [--height HEIGHT] [--width WIDTH]
               [--weighing {enet,mfb,none}] [--with-unlabeled]
               [--workers WORKERS] [--print-step] [--imshow-batch]
               [--device DEVICE] [--name NAME] [--save-dir SAVE_DIR]

For help on the optional arguments run: python main.py -h

Examples: Training

python main.py -m train --save-dir save/folder/ --name model_name --dataset name --dataset-dir path/root_directory/

Examples: Resuming training

python main.py -m train --resume True --save-dir save/folder/ --name model_name --dataset name --dataset-dir path/root_directory/

Examples: Testing

python main.py -m test --save-dir save/folder/ --name model_name --dataset name --dataset-dir path/root_directory/

Project structure

Folders

data: Contains instructions on how to download the datasets and the code that handles data loading.
metric: Evaluation-related metrics.
models: ENet model definition.
save: By default, main.py will save models in this folder. The pre-trained models can also be found here.

Files

args.py: Contains all command-line options.
main.py: Main script file used for training and/or testing the model.
test.py: Defines the Test class which is responsible for testing the model.
train.py: Defines the Train class which is responsible for training the model.
transforms.py: Defines image transformations to convert an RGB image encoding classes to a torch.LongTensor and vice versa.

PyTorch implementation of ENet

Related tags

Overview

PyTorch-ENet

Installation

Local pip

Docker image

Usage

Examples: Training

Examples: Resuming training

Examples: Testing

Project structure

Folders

Files

Owner

David Silva

[ICCV'2021] "SSH: A Self-Supervised Framework for Image Harmonization", Yifan Jiang, He Zhang, Jianming Zhang, Yilin Wang, Zhe Lin, Kalyan Sunkavalli, Simon Chen, Sohrab Amirghodsi, Sarah Kong, Zhangyang Wang

ZeroGen: Efficient Zero-shot Learning via Dataset Generation

DECAF: Deep Extreme Classification with Label Features

Code for Active Learning at The ImageNet Scale.

Base pretrained models and datasets in pytorch (MNIST, SVHN, CIFAR10, CIFAR100, STL10, AlexNet, VGG16, VGG19, ResNet, Inception, SqueezeNet)

Code from Daniel Lemire, A Better Alternative to Piecewise Linear Time Series Segmentation

This is the code for "HyperNeRF: A Higher-Dimensional Representation for Topologically Varying Neural Radiance Fields".

Code for reproducing our paper: LMSOC: An Approach for Socially Sensitive Pretraining

A novel framework to automatically learn high-quality scanning of non-planar, complex anisotropic appearance.

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.

Bilinear attention networks for visual question answering

ReGAN: Sequence GAN using RE[INFORCE|LAX|BAR] based PG estimators

List some popular DeepFake models e.g. DeepFake, FaceSwap-MarekKowal, IPGAN, FaceShifter, FaceSwap-Nirkin, FSGAN, SimSwap, CihaNet, etc.

MMdet2-based reposity about lightweight detection model: Nanodet, PicoDet.

TensorFlow implementation of Barlow Twins (Barlow Twins: Self-Supervised Learning via Redundancy Reduction)

Codebase for Attentive Neural Hawkes Process (A-NHP) and Attentive Neural Datalog Through Time (A-NDTT)

Metrics to evaluate quality and efficacy of synthetic datasets.

GUPNet - Geometry Uncertainty Projection Network for Monocular 3D Object Detection

A python library for implementing a recommender system