ST++: Make Self-training Work Better for Semi-supervised Semantic Segmentation

Last update: Jan 03, 2023

Overview

ST++

This is the official PyTorch implementation of our paper:

ST++: Make Self-training Work Better for Semi-supervised Semantic Segmentation.
Lihe Yang, Wei Zhuo, Lei Qi, Yinghuan Shi and Yang Gao.

Getting Started

Data Preparation

Pre-trained Model

ResNet-50 | ResNet-101 | DeepLabv2-ResNet-101

Dataset

Pascal | Augmented Masks | Cityscapes | Class Mapped Masks

File Organization

├── ./pretrained
    ├── resnet50.pth
    ├── resnet101.pth
    └── deeplabv2_resnet101_coco_pretrained.pth
    
├── [Your Pascal Path]
    ├── JPEGImages
    └── SegmentationClass    # replace the official folder with above augmented masks 
    
├── [Your Cityscapes Path]
    ├── gtFine               # replace the official folder with above class mapped masks 
    └── leftImg8bit

Training and Testing

export semi_setting='pascal/1_8/split_0'

CUDA_VISIBLE_DEVICES=0,1 python -W ignore main.py \
  --dataset pascal --data-root [Your Pascal Path] \
  --batch-size 16 --backbone resnet50 --model deeplabv3plus \
  --labeled-id-path dataset/splits/$semi_setting/labeled.txt \
  --unlabeled-id-path dataset/splits/$semi_setting/unlabeled.txt \
  --pseudo-mask-path outdir/pseudo_masks/$semi_setting \
  --save-path outdir/models/$semi_setting

This script is for our ST framework. To run ST++, add --plus --reliable-id-path outdir/reliable_ids/$semi_setting.

Acknowledgement

The DeepLabv2 MS COCO pre-trained model is borrowed and converted from AdvSemiSeg. The image partitions are borrowed from Context-Aware-Consistency and PseudoSeg. Part of the training hyper-parameters and network structures are adapted from PyTorch-Encoding. The strong data augmentations are borrowed from MoCo v2 and PseudoSeg.

AdvSemiSeg: https://github.com/hfslyc/AdvSemiSeg.
Context-Aware-Consistency: https://github.com/dvlab-research/Context-Aware-Consistency.
PseudoSeg: https://github.com/googleinterns/wss.
PyTorch-Encoding: https://github.com/zhanghang1989/PyTorch-Encoding.
MoCo: https://github.com/facebookresearch/moco.
OpenSelfSup: https://github.com/open-mmlab/OpenSelfSup.

Thanks a lot for their great works!

Citation

If you find this project useful, please consider citing:

@article{yang2021st++,
  title={ST++: Make Self-training Work Better for Semi-supervised Semantic Segmentation},
  author={Yang, Lihe and Zhuo, Wei and Qi, Lei and Shi, Yinghuan and Gao, Yang},
  journal={arXiv preprint arXiv:2106.05095},
  year={2021}
}

ST++: Make Self-training Work Better for Semi-supervised Semantic Segmentation

Related tags

Overview

ST++

Getting Started

Data Preparation

Pre-trained Model

Dataset

File Organization

Training and Testing

Acknowledgement

Citation

Owner

Lihe Yang

Code for "Typilus: Neural Type Hints" PLDI 2020

Python package facilitating the use of Bayesian Deep Learning methods with Variational Inference for PyTorch

Hyperparameter Optimization for TensorFlow, Keras and PyTorch

Code for ICDM2020 full paper: "Sub-graph Contrast for Scalable Self-Supervised Graph Representation Learning"

PyTorch implementation of spectral graph ConvNets, NIPS’16

Monocular 3D pose estimation. OpenVINO. CPU inference or iGPU (OpenCL) inference.

A customisable game where you have to quickly click on black tiles in order of appearance while avoiding clicking on white squares.

The code repository for "RCNet: Reverse Feature Pyramid and Cross-scale Shift Network for Object Detection" (ACM MM'21)

Code release for ICCV 2021 paper "Anticipative Video Transformer"

A Planar RGB-D SLAM which utilizes Manhattan World structure to provide optimal camera pose trajectory while also providing a sparse reconstruction containing points, lines and planes, and a dense surfel-based reconstruction.

:fire: 2D and 3D Face alignment library build using pytorch

Implementation of fast algorithms for Maximum Spanning Tree (MST) parsing that includes fast ArcMax+Reweighting+Tarjan algorithm for single-root dependency parsing.

ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation

A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility

Tensorflow python implementation of "Learning High Fidelity Depths of Dressed Humans by Watching Social Media Dance Videos"

Implementation and replication of ProGen, Language Modeling for Protein Generation, in Jax

This repository is the official implementation of the Hybrid Self-Attention NEAT algorithm.

Unofficial pytorch implementation of paper "One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing"

JAX + dataclasses

Code for Active Learning at The ImageNet Scale.