Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation

Last update: Sep 12, 2022

Related tags

Deep Learning NRD_decoder

Overview

Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation

Requirements

This repository needs mmsegmentation

Training

To train the model(s) in the paper, run this command:

python tools/train.py ./configs/NRD/ade20k/NRD_r101_512x512_164k_ade20k.py

The batch size is 16 in this work. Please change the 'samples_per_gpu' in configs/base/datasets/.. accordingly

Evaluation

To evaluate my model at single-scale inference, run:

python tools/eval.py ./configs/NRD/ade20k/NRD_r101_512x512_164k_ade20k.py  {path-to-checkpoint-file}   --eval mIoU

Pre-trained Models

Results

Our model achieves the following performance on :

[Semantic segmentation results]

Model name	datasets	mIoU	mIoU (ms)
NRD-r101	ade20k (val)	44.01	45.62
NRD-x101	ade20k (val)	44.34	46.35
NRD-r101	pascal-context(val)	52.31 (59 classes)	54.1 (59 classes)
NRD-r101	pascal-context(val)	47.5 (60 classes)	40.9 (60 classes)
NRD-r50	Cityscapes (val)	79.8	80.8
NRD-r101	Cityscapes (val)	80.7	82.0

Contributing

The code is mostly taken from mmsegmentation mmsegmentation is released under the Apache 2.0 license.

Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation

Related tags

Overview

Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation

Requirements

Training

Evaluation

Pre-trained Models

Results

[Semantic segmentation results]

Contributing

Owner

Adelaide Intelligent Machines (AIM) Group

Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"

Final report with code for KAIST Course KSE 801.

A library for building and serving multi-node distributed faiss indices.

[SIGGRAPH Asia 2021] Pose with Style: Detail-Preserving Pose-Guided Image Synthesis with Conditional StyleGAN

Out-of-boundary View Synthesis towards Full-frame Video Stabilization

Codes for our paper The Stem Cell Hypothesis: Dilemma behind Multi-Task Learning with Transformer Encoders published to EMNLP 2021.

Code to accompany the paper "Finding Bipartite Components in Hypergraphs", which is published in NeurIPS'21.

PyTorch implementation of Constrained Policy Optimization

DVG-Face: Dual Variational Generation for Heterogeneous Face Recognition, TPAMI 2021

A user-friendly research and development tool built to standardize RL competency assessment for custom agents and environments.

SHIFT15M: multiobjective large-scale fashion dataset with distributional shifts

A module for solving and visualizing Schrödinger equation.

Denoising Diffusion Probabilistic Models

Hashformers is a framework for hashtag segmentation with transformers.

Code for paper: "Spinning Language Models for Propaganda-As-A-Service"

TAUFE: Task-Agnostic Undesirable Feature DeactivationUsing Out-of-Distribution Data

An implementation of the methods presented in Causal-BALD: Deep Bayesian Active Learning of Outcomes to Infer Treatment-Effects from Observational Data.

[ICCV 2021] FaPN: Feature-aligned Pyramid Network for Dense Image Prediction

This is the official code for the paper "Ad2Attack: Adaptive Adversarial Attack for Real-Time UAV Tracking".

Asynchronous Advantage Actor-Critic in PyTorch