Adversarial Learning for Semi-supervised Semantic Segmentation, BMVC 2018

Last update: Dec 19, 2022

Overview

Adversarial Learning for Semi-supervised Semantic Segmentation

This repo is the pytorch implementation of the following paper:

Adversarial Learning for Semi-supervised Semantic Segmentation
Wei-Chih Hung, Yi-Hsuan Tsai, Yan-Ting Liou, Yen-Yu Lin, and Ming-Hsuan Yang
Proceedings of the British Machine Vision Conference (BMVC), 2018.

Contact: Wei-Chih Hung (whung8 at ucmerced dot edu)

The code are heavily borrowed from a pytorch DeepLab implementation (Link). The baseline model is DeepLabv2-Resnet101 without multiscale training and CRF post processing, which yields meanIOU 73.6% on the VOC2012 validation set.

Please cite our paper if you find it useful for your research.

@inproceedings{Hung_semiseg_2018,
  author = {W.-C. Hung and Y.-H. Tsai and Y.-T. Liou and Y.-Y. Lin and M.-H. Yang},
  booktitle = {Proceedings of the British Machine Vision Conference (BMVC)},
  title = {Adversarial Learning for Semi-supervised Semantic Segmentation},
  year = {2018}
}

Prerequisite

CUDA/CUDNN
pytorch >= 0.2 (We only support 0.4 for evaluation. Will migrate the code to 0.4 soon.)
python-opencv >=3.4.0 (3.3 will cause extra GPU memory on multithread data loader)

Installation

Clone this repo

git clone https://github.com/hfslyc/AdvSemiSeg.git

Place VOC2012 dataset in AdvSemiSeg/dataset/VOC2012. For training, you will need the augmented labels (Download). The folder structure should be like:

AdvSemiSeg/dataset/VOC2012/JPEGImages
                          /SegmentationClassAug

Testing on VOC2012 validation set with pretrained models

python evaluate_voc.py --pretrained-model semi0.125 --save-dir results

It will download the pretrained model with 1/8 training data and evaluate on the VOC2012 val set. The colorized images will be saved in results/ and the detailed class IOU will be saved in results/result.txt. The mean IOU should be around 68.8%.

Available --pretrained-model options: semi0.125, semi0.25, semi0.5 , advFull.

Example visualization results

Training on VOC2012

python train.py --snapshot-dir snapshots \
                --partial-data 0.125 \
                --num-steps 20000 \
                --lambda-adv-pred 0.01 \
                --lambda-semi 0.1 --semi-start 5000 --mask-T 0.2

The parameters correspond to those in Table 5 of the paper.

To evaluate trained model, execute the following:

python evaluate_voc.py --restore-from snapshots/VOC_20000.pth \
                       --save-dir results

Changelog

07/24/2018: Update BMVC results

Adversarial Learning for Semi-supervised Semantic Segmentation, BMVC 2018

Related tags

Overview

Adversarial Learning for Semi-supervised Semantic Segmentation

Prerequisite

Installation

Testing on VOC2012 validation set with pretrained models

Example visualization results

Training on VOC2012

Changelog

Owner

Wayne Hung

git《Tangent Space Backpropogation for 3D Transformation Groups》(CVPR 2021) GitHub:1]

Computer Vision is an elective course of MSAI, SCSE, NTU, Singapore

fcn by tensorflow

Learning Spatio-Temporal Transformer for Visual Tracking

A collection of easy-to-use, ready-to-use, interesting deep neural network models

Code accompanying paper: Meta-Learning to Improve Pre-Training

Cervix ROI Segmentation Using U-NET

Implementation of "Fast and Flexible Temporal Point Processes with Triangular Maps" (Oral @ NeurIPS 2020)

Benchmarks for semi-supervised domain generalization.

Cerberus Transformer: Joint Semantic, Affordance and Attribute Parsing

Implementation of paper "Towards a Unified View of Parameter-Efficient Transfer Learning"

Official implementation of SynthTIGER (Synthetic Text Image GEneratoR) ICDAR 2021

Pseudo lidar - (CVPR 2019) Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving

Awesome Long-Tailed Learning

Pytorch implementation of CoCon: A Self-Supervised Approach for Controlled Text Generation

Code for reproducing key results in the paper "InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets"

The implemention of Video Depth Estimation by Fusing Flow-to-Depth Proposals

aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)

code for our ECCV 2020 paper "A Balanced and Uncertainty-aware Approach for Partial Domain Adaptation"

Code for paper: Towards Tokenized Human Dynamics Representation