A Strong Baseline for Image Semantic Segmentation

Introduction

This project is an open source semantic segmentation toolbox based on PyTorch. It is based on the codes of our Tianchi competition in 2021 (https://tianchi.aliyun.com/competition/entrance/531860/introduction).
In the competition, our team won the third place (please see Tianchi_README.md).

Overview

The master branch works with PyTorch 1.6+.The project now supports popular and contemporary semantic segmentation frameworks, e.g. UNet, DeepLabV3+, HR-Net etc.

Requirements

Support

Backbone

ResNet (CVPR'2016)
SeNet (CVPR'2018)
IBN-Net (CVPR'2018)
EfficientNet (CVPR'2020)

Methods

Tricks

Tools

large image inference (cut and merge)
post process (crf/superpixels)

Quick Start

Train a model

python train.py --config_file ${CONFIG_FILE}

CONFIG_FILE: File of training config about model

Examples:
We trained our model in Tianchi competition according to the following script:
Stage 1 (160e)

python train.py --config_file configs/tc_seg/tc_seg_res_unet_r34_ibn_a_160e.yml

Stage 2 (swa 24e)

python train.py --config_file configs/tc_seg/tc_seg_res_unet_r34_ibn_a_swa.yml

Inference with pretrained models

python inference.py --config_file ${CONFIG_FILE}

CONFIG_FILE: File of inference config about model

Predict large image with pretrained models

python predict_demo.py --config_file ${CONFIG_FILE} --rs_img_file ${IMAGE_FILE_PATH} --temp_img_save_path ${TEMP_CUT_PATH} -temp_seg_map_save_path ${TEMP_SAVE_PATH} --save_seg_map_file ${SAVE_SEG_FILE}

CONFIG_FILE: File of inference config about model
IMAGE_FILE_PATH: File of large input image to predict
TEMP_CUT_PATH: Temp folder of small cutting samples
TEMP_SAVE_PATH: Temp folder of predict results of cutting samples
SAVE_SEG_FILE: Predict result of the large image

A Strong Baseline for Image Semantic Segmentation

Related tags

Overview

A Strong Baseline for Image Semantic Segmentation

Introduction

Overview

Requirements

Support

Backbone

Methods

Tricks

Tools

Quick Start

Train a model

Inference with pretrained models

Predict large image with pretrained models

Owner

Clark He

Embracing Single Stride 3D Object Detector with Sparse Transformer

NLP made easy

Fastshap: A fast, approximate shap kernel

RRL: Resnet as representation for Reinforcement Learning

Baseline model for "GraspNet-1Billion: A Large-Scale Benchmark for General Object Grasping" (CVPR 2020)

EM-POSE 3D Human Pose Estimation from Sparse Electromagnetic Trackers.

The official implementation of "Rethink Dilated Convolution for Real-time Semantic Segmentation"

Implementation and replication of ProGen, Language Modeling for Protein Generation, in Jax

Generate text captions for images from their CLIP embeddings. Includes PyTorch model code and example training script.

Repository for "Toward Practical Monocular Indoor Depth Estimation" (CVPR 2022)

CenterNet:Objects as Points目标检测模型在Pytorch当中的实现

Project page for End-to-end Recovery of Human Shape and Pose

基于Paddle框架的fcanet复现

Deep Unsupervised 3D SfM Face Reconstruction Based on Massive Landmark Bundle Adjustment.

Non-Attentive-Tacotron - This is Pytorch Implementation of Google's Non-attentive Tacotron.

Implementation of SSMF: Shifting Seasonal Matrix Factorization

Offcial repository for the IEEE ICRA 2021 paper Auto-Tuned Sim-to-Real Transfer.

This repository contains all code and data for the Inside Out Visual Place Recognition task

68 keypoint annotations for COFW test data

A PyTorch Implementation of SphereFace.