ICNet for Real-Time Semantic Segmentation on High-Resolution Images, ECCV2018

Last update: Dec 31, 2022

Related tags

Deep Learning ICNet

Overview

ICNet for Real-Time Semantic Segmentation on High-Resolution Images

by Hengshuang Zhao, Xiaojuan Qi, Xiaoyong Shen, Jianping Shi, Jiaya Jia, details are in project page.

Introduction

Based on PSPNet, this repository is build for evaluation in ICNet. For installation, please follow the description in PSPNet repository (support CUDA 7.0/7.5 + cuDNN v4).

Usage

Clone the repository recursively:

git clone --recursive https://github.com/hszhao/ICNet.git

Build Caffe and matcaffe:

cd $ICNET_ROOT/PSPNet
cp Makefile.config.example Makefile.config
vim Makefile.config
make -j8 && make matcaffe
cd ..

Evaluation mIoU:
- Evaluation code is in folder 'evaluation'.
- Download trained models and put them in folder 'evaluation/model':
  - icnet_cityscapes_train_30k.caffemodel: GoogleDrive
    
    (31M, md5: c7038630c4b6c869afaaadd811bdb539; train on trainset for 30k)
  - icnet_cityscapes_trainval_90k.caffemodel: GoogleDrive
    
    (31M, md5: 4f4dd9eecd465dd8de7e4cf88ba5d5d5; train on trainvalset for 90k)
- Modify the related paths in 'eval_all.m':
  - Mainly variables 'data_root' and 'eval_list', and your image list for evaluation should be similar to that in folder 'evaluation/samplelist' if you use this evaluation code structure.
```
cd evaluation
vim eval_all.m
```
- Run the evaluation scripts:
```
./run.sh
```
Evaluation time:
- To get inference time as accurate as possible, it's suggested to make sure the GPU card with specified ID in script 'test_time.sh' is empty (without other processes executing)
- Run the evaluation scripts:
```
./test_time.sh
```
Results:
- Prediction results will show in folder 'evaluation/mc_result' and the expected scores are:
  - ICNet train on trainset for 30K, evaluated on valset (mIoU/pAcc): 67.7/94.5
  - ICNet train on trainvalset for 90K, evaluated on testset (mIoU): 69.5
- Log information of inference time will be in file 'time.log', approximately 33~36ms on TitanX.
Demo video:
- Video processed by ICNet on cityscapes dataset:
  - Alpha blending with value as 0.5: Video

Citation

If ICNet is useful for your research, please consider citing:

@inproceedings{zhao2018icnet,
  title={ICNet for Real-Time Semantic Segmentation on High-Resolution Images},
  author={Zhao, Hengshuang and Qi, Xiaojuan and Shen, Xiaoyong and Shi, Jianping and Jia, Jiaya},
  booktitle={ECCV},
  year={2018}
}

Questions

Please contact '[email protected]'

ICNet for Real-Time Semantic Segmentation on High-Resolution Images, ECCV2018

Related tags

Overview

ICNet for Real-Time Semantic Segmentation on High-Resolution Images

Introduction

Usage

Citation

Questions

Owner

Hengshuang Zhao

This is the repository for Learning to Generate Piano Music With Sustain Pedals

An ML & Correlation platform for transforming disparate data points of interest into usable intelligence.

[ICCV2021] Official code for "Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition"

MILK: Machine Learning Toolkit

An implementation of "MixHop: Higher-Order Graph Convolutional Architectures via Sparsified Neighborhood Mixing" (ICML 2019).

3D mesh stylization driven by a text input in PyTorch

A deep learning based semantic search platform that computes similarity scores between provided query and documents

An Implementation of Transformer in Transformer in TensorFlow for image classification, attention inside local patches

The official codes for the ICCV2021 presentation "Uniformity in Heterogeneity: Diving Deep into Count Interval Partition for Crowd Counting"

FocusFace: Multi-task Contrastive Learning for Masked Face Recognition

Reference models and tools for Cloud TPUs.

This is an official pytorch implementation of Fast Fourier Convolution.

Implementation of "Selection via Proxy: Efficient Data Selection for Deep Learning" from ICLR 2020.

RodoSol-ALPR Dataset

Evaluating saliency methods on artificial data with different background types

Code for NeurIPS 2020 article "Contrastive learning of global and local features for medical image segmentation with limited annotations"

RIFE: Real-Time Intermediate Flow Estimation for Video Frame Interpolation

Code for the paper "Learning-Augmented Algorithms for Online Steiner Tree"

A user-friendly research and development tool built to standardize RL competency assessment for custom agents and environments.

Deployment of PyTorch chatbot with Flask