Robust Instance Segmentation through Reasoning about Multi-Object Occlusion [CVPR 2021]

Last update: Jun 27, 2022

Related tags

Overview

Robust Instance Segmentation through Reasoning about Multi-Object Occlusion [CVPR 2021]

Abstract

Analyzing complex scenes with DNN is a challenging task, particularly when images contain multiple objects that partially occlude each other. Existing approaches to image analysis mostly process objects independently and do not take into account the relative occlusion of nearby objects. We propose a deep network for multi-object instance segmentation that is robust to occlusion and can be trained from bounding box supervision only.

We also introduce an Occlusion Challenge dataset generated from real-world segmented objects with accurate annotations and propose a taxonomy of occlusion scenarios that pose a particular challenge for computer vision.

NOTICE

dataset links and model will be released in a few days. Update: 18 June

Requirments

The code uses Python 3.6 and it is tested on PyTorch GPU version 1.2, with CUDA-10.0 and cuDNN-7.5.

Installation

Clone the repository with:

git clone https://github.com/XD7479/Multi-Object-Occlusion.git
cd Multi-Object-Occlusion

Install requirments:

pip install -r requirements.txt

Datasets

Download the KINS dataset here and the Occlusion Challenge dataset here.
Enter the project folder and make links for the datasets:

ln -s  kins
ln -s  occ_challenge

Download the pre-trained model here.
Make links for the pre-trained model:

ln -s  models

Check the configuration file configs.py for the dataset and backbone you're using:

dataset_eval = 'occ_challenge'      # kins, occ_challenge
nn_type = 'resnext'             # vgg, resnext

Run the evaluation code with:

python3 eval_meanIoU.py

Segmentation Demo

Citation

@misc{yuan2021robust,
      title={Robust Instance Segmentation through Reasoning about Multi-Object Occlusion}, 
      author={Xiaoding Yuan and Adam Kortylewski and Yihong Sun and Alan Yuille},
      booktitle = {Proceedings IEEE Conf. on Computer Vision and Pattern Recognition (CVPR)},
      month = jun,
      year = {2021},
      month_numeric = {6}
}

Contact

If you have any questions you can contact Xiaoding Yuan by [email protected].

Robust Instance Segmentation through Reasoning about Multi-Object Occlusion [CVPR 2021]

Related tags

Overview

Robust Instance Segmentation through Reasoning about Multi-Object Occlusion [CVPR 2021]

Abstract

NOTICE

Requirments

Installation

Datasets

Segmentation Demo

Citation

Contact

Owner

Irene Yuan

Bayesian Meta-Learning Through Variational Gaussian Processes

Code and Experiments for ACL-IJCNLP 2021 Paper Mind Your Outliers! Investigating the Negative Impact of Outliers on Active Learning for Visual Question Answering.

CowHerd is a partially-observed reinforcement learning environment

StyleGAN2 with adaptive discriminator augmentation (ADA) - Official TensorFlow implementation

Official Repo for Ground-aware Monocular 3D Object Detection for Autonomous Driving

A smaller subset of 10 easily classified classes from Imagenet, and a little more French

i-SpaSP: Structured Neural Pruning via Sparse Signal Recovery

Meta-learning for NLP

You Only Look One-level Feature (YOLOF), CVPR2021, Detectron2

C3DPO - Canonical 3D Pose Networks for Non-rigid Structure From Motion.

(NeurIPS 2020) Wasserstein Distances for Stereo Disparity Estimation

On Effective Scheduling of Model-based Reinforcement Learning

Transformer in Computer Vision

A Momentumized, Adaptive, Dual Averaged Gradient Method for Stochastic Optimization

Predict bus arrival time using VertexAI and Nvidia's Jetson Nano

Code from the paper "High-Performance Brain-to-Text Communication via Handwriting"

PyElastica is the Python implementation of Elastica, an open-source software for the simulation of assemblies of slender, one-dimensional structures using Cosserat Rod theory.

Large-scale open domain KNOwledge grounded conVERsation system based on PaddlePaddle

Official PyTorch implementation of the paper: Improving Graph Neural Network Expressivity via Subgraph Isomorphism Counting.

Out-of-boundary View Synthesis towards Full-frame Video Stabilization