PyTorch implementation of ShapeConv: Shape-aware Convolutional Layer for RGB-D Indoor Semantic Segmentation.

Last update: Dec 29, 2022

Related tags

Overview

Shape-aware Convolutional Layer (ShapeConv)

PyTorch implementation of ShapeConv: Shape-aware Convolutional Layer for RGB-D Indoor Semantic Segmentation.

Introduction

We design a Shape-aware Convolutional(ShapeConv) layer to explicitly model the shape information for enhancing the RGB-D semantic segmentation accuracy. Specifically, we decompose the depth feature into a shape-component and a value component, after which two learnable weights are introduced to handle the shape and value with differentiation. Extensive experiments on three challenging indoor RGB-D semantic segmentation benchmarks, i.e., NYU-Dv2(-13,-40), SUN RGB-D, and SID, demonstrate the effectiveness of our ShapeConv when employing it over five popular architectures.

Usage

Installation

Requirements

Linux
Python 3.6+
PyTorch 1.7.0 or higher
CUDA 10.0 or higher

We have tested the following versions of OS and softwares:

OS: Ubuntu 16.04.6 LTS
CUDA: 10.0
PyTorch 1.7.0
Python 3.6.9

Install dependencies.

pip install -r requirements.txt

Dataset

Download the offical dataset and convert to a format appropriate for this project. See here.

Or download the converted dataset:

Evaluation

Model

Download trained model and put it in folder ./model_zoo. See all trained models here.
Config

Edit config file in ./config. The config files in ./config correspond to the model files in ./models.
1. Set inference.gpu_id = CUDA_VISIBLE_DEVICES. CUDA_VISIBLE_DEVICES is used to specify which GPUs should be visible to a CUDA application, e.g., inference.gpu_id = "0,1,2,3".
2. Set dataset_root = path_to_dataset. path_to_dataset represents the path of dataset. e.g.,dataset_root = "/home/shape_conv/nyu_v2".
Run
1. Ditributed evaluation, please run:
```
./tools/dist_test.sh config_path checkpoint_path gpu_num
```
- config_path is path of config file;
- checkpoint_pathis path of model file;
- gpu_num is the number of GPUs used, note that gpu_num <= len(inference.gpu_id).
E.g., evaluate shape-conv model on NYU-V2(40 categories), please run:
```
./tools/dist_test.sh configs/nyu/nyu40_deeplabv3plus_resnext101_shape.py model_zoo/nyu40_deeplabv3plus_resnext101_shape.pth 4
```
1. Non-distributed evaluation
```
python tools/test.py config_path checkpoint_path
```

Train

Config

Edit config file in ./config.
1. Set inference.gpu_id = CUDA_VISIBLE_DEVICES.
  
  E.g.,inference.gpu_id = "0,1,2,3".
2. Set dataset_root = path_to_dataset.
  
  E.g.,dataset_root = "/home/shape_conv/nyu_v2".

Run

Ditributed training

./tools/dist_train.sh config_path gpu_num

E.g., train shape-conv model on NYU-V2(40 categories) with 4 GPUs, please run:

./tools/dist_train.sh configs/nyu/nyu40_deeplabv3plus_resnext101_shape.py 4

Non-distributed training

python tools/train.py config_path

Result

For more result, please see model zoo.

NYU-V2(40 categories)

Architecture	Backbone	MS & Flip	Shape Conv	mIOU
DeepLabv3plus	ResNeXt-101	False	False	48.9%
DeepLabv3plus	ResNeXt-101	False	True	50.2%
DeepLabv3plus	ResNeXt-101	True	False	50.3%
DeepLabv3plus	ResNeXt-101	True	True	51.3%

SUN-RGBD

Architecture	Backbone	MS & Flip	Shape Conv	mIOU
DeepLabv3plus	ResNet-101	False	False	46.9%
DeepLabv3plus	ResNet-101	False	True	47.6%
DeepLabv3plus	ResNet-101	True	False	47.6%
DeepLabv3plus	ResNet-101	True	True	48.6%

SID(Stanford Indoor Dataset)

Architecture	Backbone	MS & Flip	Shape Conv	mIOU
DeepLabv3plus	ResNet-101	False	False	54.55%
DeepLabv3plus	ResNet-101	False	True	60.6%

Acknowledgments

This repo was developed based on vedaseg.

PyTorch implementation of ShapeConv: Shape-aware Convolutional Layer for RGB-D Indoor Semantic Segmentation.

Related tags

Overview

Shape-aware Convolutional Layer (ShapeConv)

Introduction

Usage

Installation

Dataset

Evaluation

Train

Result

NYU-V2(40 categories)

SUN-RGBD

SID(Stanford Indoor Dataset)

Acknowledgments

Owner

Hanchao Leng

Python script for performing depth completion from sparse depth and rgb images using the msg_chn_wacv20. model in Tensorflow Lite.

Permute Me Softly: Learning Soft Permutations for Graph Representations

Official Codes for Graph Modularity:Towards Understanding the Cross-Layer Transition of Feature Representations in Deep Neural Networks.

Minecraft Hack Detection With Python

🔀 Visual Room Rearrangement

Facestar dataset. High quality audio-visual recordings of human conversational speech.

Pytorch code for our paper Beyond ImageNet Attack: Towards Crafting Adversarial Examples for Black-box Domains)

Experimental Python implementation of OpenVINO Inference Engine (very slow, limited functionality). All codes are written in Python. Easy to read and modify.

PyTorch implementation of the paper Deep Networks from the Principle of Rate Reduction

Repo for "Benchmarking Robustness of 3D Point Cloud Recognition against Common Corruptions" https://arxiv.org/abs/2201.12296

NR-GAN: Noise Robust Generative Adversarial Networks

Semantic Segmentation in Pytorch. Network include: FCN、FCN_ResNet、SegNet、UNet、BiSeNet、BiSeNetV2、PSPNet、DeepLabv3_plus、 HRNet、DDRNet

YKKDetector For Python

MM1 and MMC Queue Simulation using python - Results and parameters in excel and csv files

Fuzzification helps developers protect the released, binary-only software from attackers who are capable of applying state-of-the-art fuzzing techniques

Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.

N-Omniglot is a large neuromorphic few-shot learning dataset

Dynamic Attentive Graph Learning for Image Restoration, ICCV2021 [PyTorch Code]

A Tensorflow based library for Time Series Modelling with Gaussian Processes

FasterAI: A library to make smaller and faster models with FastAI.