Adversarial Learning for Semi-supervised Semantic Segmentation, BMVC 2018

Last update: Dec 19, 2022

Overview

Adversarial Learning for Semi-supervised Semantic Segmentation

This repo is the pytorch implementation of the following paper:

Adversarial Learning for Semi-supervised Semantic Segmentation
Wei-Chih Hung, Yi-Hsuan Tsai, Yan-Ting Liou, Yen-Yu Lin, and Ming-Hsuan Yang
Proceedings of the British Machine Vision Conference (BMVC), 2018.

Contact: Wei-Chih Hung (whung8 at ucmerced dot edu)

The code are heavily borrowed from a pytorch DeepLab implementation (Link). The baseline model is DeepLabv2-Resnet101 without multiscale training and CRF post processing, which yields meanIOU 73.6% on the VOC2012 validation set.

Please cite our paper if you find it useful for your research.

@inproceedings{Hung_semiseg_2018,
  author = {W.-C. Hung and Y.-H. Tsai and Y.-T. Liou and Y.-Y. Lin and M.-H. Yang},
  booktitle = {Proceedings of the British Machine Vision Conference (BMVC)},
  title = {Adversarial Learning for Semi-supervised Semantic Segmentation},
  year = {2018}
}

Prerequisite

CUDA/CUDNN
pytorch >= 0.2 (We only support 0.4 for evaluation. Will migrate the code to 0.4 soon.)
python-opencv >=3.4.0 (3.3 will cause extra GPU memory on multithread data loader)

Installation

Clone this repo

git clone https://github.com/hfslyc/AdvSemiSeg.git

Place VOC2012 dataset in AdvSemiSeg/dataset/VOC2012. For training, you will need the augmented labels (Download). The folder structure should be like:

AdvSemiSeg/dataset/VOC2012/JPEGImages
                          /SegmentationClassAug

Testing on VOC2012 validation set with pretrained models

python evaluate_voc.py --pretrained-model semi0.125 --save-dir results

It will download the pretrained model with 1/8 training data and evaluate on the VOC2012 val set. The colorized images will be saved in results/ and the detailed class IOU will be saved in results/result.txt. The mean IOU should be around 68.8%.

Available --pretrained-model options: semi0.125, semi0.25, semi0.5 , advFull.

Example visualization results

Training on VOC2012

python train.py --snapshot-dir snapshots \
                --partial-data 0.125 \
                --num-steps 20000 \
                --lambda-adv-pred 0.01 \
                --lambda-semi 0.1 --semi-start 5000 --mask-T 0.2

The parameters correspond to those in Table 5 of the paper.

To evaluate trained model, execute the following:

python evaluate_voc.py --restore-from snapshots/VOC_20000.pth \
                       --save-dir results

Changelog

07/24/2018: Update BMVC results

Adversarial Learning for Semi-supervised Semantic Segmentation, BMVC 2018

Related tags

Overview

Adversarial Learning for Semi-supervised Semantic Segmentation

Prerequisite

Installation

Testing on VOC2012 validation set with pretrained models

Example visualization results

Training on VOC2012

Changelog

Owner

Wayne Hung

Optimizing Value-at-Risk and Conditional Value-at-Risk of Black Box Functions with Lacing Values (LV)

Official code for "On the Frequency Bias of Generative Models", NeurIPS 2021

On Nonlinear Latent Transformations for GAN-based Image Editing - PyTorch implementation

Pytorch implementation of "ARM: Any-Time Super-Resolution Method"

Improving Factual Completeness and Consistency of Image-to-text Radiology Report Generation

Official repository of "DeepMIH: Deep Invertible Network for Multiple Image Hiding", TPAMI 2022.

Deep Learning Models for Causal Inference

TensorFlow implementation of original paper : https://github.com/hszhao/PSPNet

This is the official code of L2G, Unrolling and Recurrent Unrolling in Learning to Learn Graph Topologies.

How to Learn a Domain Adaptive Event Simulator? ACM MM, 2021

GANTheftAuto is a fork of the Nvidia's GameGAN

CVPR 2022 "Online Convolutional Re-parameterization"

Sound Source Localization for AI Grand Challenge 2021

[3DV 2021] Channel-Wise Attention-Based Network for Self-Supervised Monocular Depth Estimation

Tensorflow implementation of "Learning Deep Features for Discriminative Localization"

StarGAN v2-Tensorflow - Simple Tensorflow implementation of StarGAN v2

Some methods for comparing network representations in deep learning and neuroscience.

Low Complexity Channel estimation with Neural Network Solutions

Simple keras FCN Encoder/Decoder model for MS-COCO (food subset) segmentation

Homepage of paper: Paint Transformer: Feed Forward Neural Painting with Stroke Prediction, ICCV 2021.