ICNet for Real-Time Semantic Segmentation on High-Resolution Images, ECCV2018

Last update: Dec 31, 2022

Related tags

Deep Learning ICNet

Overview

ICNet for Real-Time Semantic Segmentation on High-Resolution Images

by Hengshuang Zhao, Xiaojuan Qi, Xiaoyong Shen, Jianping Shi, Jiaya Jia, details are in project page.

Introduction

Based on PSPNet, this repository is build for evaluation in ICNet. For installation, please follow the description in PSPNet repository (support CUDA 7.0/7.5 + cuDNN v4).

Usage

Clone the repository recursively:

git clone --recursive https://github.com/hszhao/ICNet.git

Build Caffe and matcaffe:

cd $ICNET_ROOT/PSPNet
cp Makefile.config.example Makefile.config
vim Makefile.config
make -j8 && make matcaffe
cd ..

Evaluation mIoU:
- Evaluation code is in folder 'evaluation'.
- Download trained models and put them in folder 'evaluation/model':
  - icnet_cityscapes_train_30k.caffemodel: GoogleDrive
    
    (31M, md5: c7038630c4b6c869afaaadd811bdb539; train on trainset for 30k)
  - icnet_cityscapes_trainval_90k.caffemodel: GoogleDrive
    
    (31M, md5: 4f4dd9eecd465dd8de7e4cf88ba5d5d5; train on trainvalset for 90k)
- Modify the related paths in 'eval_all.m':
  - Mainly variables 'data_root' and 'eval_list', and your image list for evaluation should be similar to that in folder 'evaluation/samplelist' if you use this evaluation code structure.
```
cd evaluation
vim eval_all.m
```
- Run the evaluation scripts:
```
./run.sh
```
Evaluation time:
- To get inference time as accurate as possible, it's suggested to make sure the GPU card with specified ID in script 'test_time.sh' is empty (without other processes executing)
- Run the evaluation scripts:
```
./test_time.sh
```
Results:
- Prediction results will show in folder 'evaluation/mc_result' and the expected scores are:
  - ICNet train on trainset for 30K, evaluated on valset (mIoU/pAcc): 67.7/94.5
  - ICNet train on trainvalset for 90K, evaluated on testset (mIoU): 69.5
- Log information of inference time will be in file 'time.log', approximately 33~36ms on TitanX.
Demo video:
- Video processed by ICNet on cityscapes dataset:
  - Alpha blending with value as 0.5: Video

Citation

If ICNet is useful for your research, please consider citing:

@inproceedings{zhao2018icnet,
  title={ICNet for Real-Time Semantic Segmentation on High-Resolution Images},
  author={Zhao, Hengshuang and Qi, Xiaojuan and Shen, Xiaoyong and Shi, Jianping and Jia, Jiaya},
  booktitle={ECCV},
  year={2018}
}

Questions

Please contact '[email protected]'

ICNet for Real-Time Semantic Segmentation on High-Resolution Images, ECCV2018

Related tags

Overview

ICNet for Real-Time Semantic Segmentation on High-Resolution Images

Introduction

Usage

Citation

Questions

Owner

Hengshuang Zhao

Smart edu-autobooking - Johnson @ DMI-UNICT study room self-booking system

[BMVC 2021] Official PyTorch Implementation of Self-supervised learning of Image Scale and Orientation Estimation

Dataset and Code for ICCV 2021 paper "Real-world Video Super-resolution: A Benchmark Dataset and A Decomposition based Learning Scheme"

Pytorch port of Google Research's LEAF Audio paper

Streaming Anomaly Detection Framework in Python (Outlier Detection for Streaming Data)

PyTorch implementation of "Learning to Discover Cross-Domain Relations with Generative Adversarial Networks"

A series of Jupyter notebooks with Chinese comment that walk you through the fundamentals of Machine Learning and Deep Learning in python using Scikit-Learn and TensorFlow.

Official implementation of "One-Shot Voice Conversion with Weight Adaptive Instance Normalization".

Python scripts for performing 3D human pose estimation using the Mobile Human Pose model in ONNX.

DeepSpamReview: Detection of Fake Reviews on Online Review Platforms using Deep Learning Architectures. Summer Internship project at CoreView Systems.

Scripts and a shader to get you started on setting up an exported Koikatsu character in Blender.

PyTorch implementation of Glow

The codebase for Data-driven general-purpose voice activity detection.

Code release for Universal Domain Adaptation(CVPR 2019)

SnapMix: Semantically Proportional Mixing for Augmenting Fine-grained Data (AAAI 2021)

WeakVRD-Captioning - Implementation of paper Improving Image Captioning with Better Use of Caption

A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

Pytorch implementation for "Large-Scale Long-Tailed Recognition in an Open World" (CVPR 2019 ORAL)

pq is a jq-like Pickle file viewer

This repo generates the training data and the model for Morpheus-Deblend