Refer-it-in-RGBD

This is the repository of our paper 'Refer-it-in-RGBD: A Bottom-up Approach for 3D Visual Grounding in RGBD Images' in CVPR 2021

Paper - ArXiv - pdf (abs)
Project page: https://unclemedm.github.io/Refer-it-in-RGBD/

Introduction

We present a novel task of 3D visual grounding in single-view RGB-D images where the referred objects are often only partially scanned. In contrast to previous works that directly generate object proposals for grounding in the 3D scenes, we propose a bottom-up approach to gradually aggregate information, effectively addressing the challenge posed by the partial scans. Our approach first fuses the language and the visual features at the bottom level to generate a heatmap that coarsely localizes the relevant regions in the RGB-D image. Then our approach adopts an adaptive search based on the heatmap and performs the object-level matching with another visio-linguistic fusion to finally ground the referred object. We evaluate the proposed method by comparing to the state-of-the-art methods on both the RGB-D images extracted from the ScanRefer dataset and our newly collected SUN-Refer dataset. Experiments show that our method outperforms the previous methods by a large margin (by 11.1% and 11.2% [email protected]) on both datasets.

Dataset

Download SUNREFER_v2 dataset
SUNREFER dataset contains 38,495 referring expression corresponding to 7,699 objects from SUNRGBD dataset. Here is one example from SUNREFER dataset:

Repository of our paper 'Refer-it-in-RGBD' in CVPR 2021

Related tags

Overview

Refer-it-in-RGBD

Introduction

Dataset

Owner

Haolin Liu

Code for our CVPR2021 paper coordinate attention

Time Series Cross-Validation -- an extension for scikit-learn

Reimplementation of NeurIPS'19: "Meta-Weight-Net: Learning an Explicit Mapping For Sample Weighting" by Shu et al.

Patch-Based Deep Autoencoder for Point Cloud Geometry Compression

LAMDA: Label Matching Deep Domain Adaptation

QueryDet: Cascaded Sparse Query for Accelerating High-Resolution SmallObject Detection

pytorchのスライス代入操作をonnxに変換する際にScatterNDならないようにするサンプル

🐥A PyTorch implementation of OpenAI's finetuned transformer language model with a script to import the weights pre-trained by OpenAI

Code for NeurIPS 2021 paper: Invariant Causal Imitation Learning for Generalizable Policies

Code for Max-Margin Contrastive Learning - AAAI 2022

IDA file loader for UF2, created for the DEFCON 29 hardware badge

Official Code for VideoLT: Large-scale Long-tailed Video Recognition (ICCV 2021)

Gas detection for Raspberry Pi using ADS1x15 and MQ-2 sensors

Chatbot in 200 lines of code using TensorLayer

Supporting code for short YouTube series Neural Networks Demystified.

ContourletNet: A Generalized Rain Removal Architecture Using Multi-Direction Hierarchical Representation

Pytorch Lightning code guideline for conferences

This code is a toolbox that uses Torch library for training and evaluating the ERFNet architecture for semantic segmentation.

Iterative Training: Finding Binary Weight Deep Neural Networks with Layer Binarization

Official code for: A Probabilistic Hard Attention Model For Sequentially Observed Scenes