Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation

Last update: May 28, 2022

Related tags

Deep Learning NRD_decoder

Overview

Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation

Requirements

This repository needs mmsegmentation

Training

To train the model(s) in the paper, run this command:

python tools/train.py ./configs/NRD/ade20k/NRD_r101_512x512_164k_ade20k.py

The batch size is 16 in this work. Please change the 'samples_per_gpu' in configs/base/datasets/.. accordingly

Evaluation

To evaluate my model at single-scale inference, run:

python tools/eval.py ./configs/NRD/ade20k/NRD_r101_512x512_164k_ade20k.py  {path-to-checkpoint-file}   --eval mIoU

Pre-trained Models

Results

Our model achieves the following performance on :

[Semantic segmentation results]

Model name	datasets	mIoU	mIoU (ms)
NRD-r101	ade20k (val)	44.01	45.62
NRD-x101	ade20k (val)	44.34	46.35
NRD-r101	pascal-context(val)	52.31 (59 classes)	54.1 (59 classes)
NRD-r101	pascal-context(val)	47.5 (60 classes)	40.9 (60 classes)
NRD-r50	Cityscapes (val)	79.8	80.8
NRD-r101	Cityscapes (val)	80.7	82.0

Contributing

The code is mostly taken from mmsegmentation mmsegmentation is released under the Apache 2.0 license.

Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation

Related tags

Overview

Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation

Requirements

Training

Evaluation

Pre-trained Models

Results

[Semantic segmentation results]

Contributing

Owner

Wav2Vec for speech recognition, classification, and audio classification

EFENet: Reference-based Video Super-Resolution with Enhanced Flow Estimation

Code, Models and Datasets for OpenViDial Dataset

PiRapGenerator - Make anyone rap the digits of pi

Crosslingual Segmental Language Model

Learning cell communication from spatial graphs of cells

Tensorflow implementation of "Learning Deep Features for Discriminative Localization"

[ ICCV 2021 Oral ] Our method can estimate camera poses and neural radiance fields jointly when the cameras are initialized at random poses in complex scenarios (outside-in scenes, even with less texture or intense noise )

Efficiently Disentangle Causal Representations

Canonical Capsules: Unsupervised Capsules in Canonical Pose (NeurIPS 2021)

Athena is the only tool that you will ever need to optimize your portfolio.

Unofficial implementation of the Involution operation from CVPR 2021

Code for the prototype tool in our paper "CoProtector: Protect Open-Source Code against Unauthorized Training Usage with Data Poisoning".

Unified learning approach for egocentric hand gesture recognition and fingertip detection

clustimage is a python package for unsupervised clustering of images.

PyTorch implementation of the Transformer in Post-LN (Post-LayerNorm) and Pre-LN (Pre-LayerNorm).

The software associated with a paper accepted at EMNLP 2021 titled "Open Knowledge Graphs Canonicalization using Variational Autoencoders".

这是一个yolox-pytorch的源码，可以用于训练自己的模型。

3D Human Pose Machines with Self-supervised Learning

PyTorch implementation of the cross-modality generative model that synthesizes dance from music.