A Strong Baseline for Image Semantic Segmentation

Introduction

This project is an open source semantic segmentation toolbox based on PyTorch. It is based on the codes of our Tianchi competition in 2021 (https://tianchi.aliyun.com/competition/entrance/531860/introduction).
In the competition, our team won the third place (please see Tianchi_README.md).

Overview

The master branch works with PyTorch 1.6+.The project now supports popular and contemporary semantic segmentation frameworks, e.g. UNet, DeepLabV3+, HR-Net etc.

Requirements

Support

Backbone

ResNet (CVPR'2016)
SeNet (CVPR'2018)
IBN-Net (CVPR'2018)
EfficientNet (CVPR'2020)

Methods

Tricks

Tools

large image inference (cut and merge)
post process (crf/superpixels)

Quick Start

Train a model

python train.py --config_file ${CONFIG_FILE}

CONFIG_FILE: File of training config about model

Examples:
We trained our model in Tianchi competition according to the following script:
Stage 1 (160e)

python train.py --config_file configs/tc_seg/tc_seg_res_unet_r34_ibn_a_160e.yml

Stage 2 (swa 24e)

python train.py --config_file configs/tc_seg/tc_seg_res_unet_r34_ibn_a_swa.yml

Inference with pretrained models

python inference.py --config_file ${CONFIG_FILE}

CONFIG_FILE: File of inference config about model

Predict large image with pretrained models

python predict_demo.py --config_file ${CONFIG_FILE} --rs_img_file ${IMAGE_FILE_PATH} --temp_img_save_path ${TEMP_CUT_PATH} -temp_seg_map_save_path ${TEMP_SAVE_PATH} --save_seg_map_file ${SAVE_SEG_FILE}

CONFIG_FILE: File of inference config about model
IMAGE_FILE_PATH: File of large input image to predict
TEMP_CUT_PATH: Temp folder of small cutting samples
TEMP_SAVE_PATH: Temp folder of predict results of cutting samples
SAVE_SEG_FILE: Predict result of the large image

A Strong Baseline for Image Semantic Segmentation

Related tags

Overview

A Strong Baseline for Image Semantic Segmentation

Introduction

Overview

Requirements

Support

Backbone

Methods

Tricks

Tools

Quick Start

Train a model

Inference with pretrained models

Predict large image with pretrained models

Owner

Clark He

Pmapper is a super-resolution and deconvolution toolkit for python 3.6+

An implementation for the loss function proposed in Decoupled Contrastive Loss paper.

Deep Implicit Moving Least-Squares Functions for 3D Reconstruction

Personal implementation of paper "Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval"

VIMPAC: Video Pre-Training via Masked Token Prediction and Contrastive Learning

Tensorflow implementation of Character-Aware Neural Language Models.

The `rtdl` library + The official implementation of the paper

AdaNet is a lightweight TensorFlow-based framework for automatically learning high-quality models with minimal expert intervention

City Surfaces: City-scale Semantic Segmentation of Sidewalk Surfaces

A Differentiable Recipe for Learning Visual Non-Prehensile Planar Manipulation

Code for "NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video", CVPR 2021 oral

An open source object detection toolbox based on PyTorch

HeartRate detector with ArduinoandPython - Use Arduino and Python create a heartrate detector.

Manim is an engine for precise programmatic animations, designed for creating explanatory math videos

Human Pose estimation with TensorFlow framework

RuDOLPH: One Hyper-Modal Transformer can be creative as DALL-E and smart as CLIP

A project that uses optical flow and machine learning to detect aimhacking in video clips.

Trading and Backtesting environment for training reinforcement learning agent or simple rule base algo.

VLGrammar: Grounded Grammar Induction of Vision and Language

Capstone-Project-2 - A game program written in the Python language