This is the official pytorch implementation for our ICCV 2021 paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering" on VQA Task

Last update: Dec 29, 2022

Related tags

Deep Learning ERASOR

Overview

🌈 ERASOR (RA-L'21 with ICRA Option)

Official page of "ERASOR: Egocentric Ratio of Pseudo Occupancy-based Dynamic Object Removal for Static 3D Point Cloud Map Building", which is accepted by RA-L with ICRA'21 option [Demo Video].

We provide all contents including

Source code of ERASOR
All outputs of the State-of-the-arts
Visualization
Calculation code of Preservation Rate/Rejection Rate

So enjoy our codes! :)

Contact: Hyungtae Lim ([email protected])

Advisor: Hyun Myung ([email protected])

Test Env.
Requirements
How to Run ERASOR
Calculate PR/RR
Benchmark
Run Your Own Code
Visualization of All the State-of-the-arts
Citation

Test Env.

The code is tested successfully at

Linux 18.04 LTS
ROS Melodic

Requirements

ROS Setting

Install ROS on a machine.
Also, jsk-visualization is required to visualize Scan Ratio Test (SRT) status.

sudo apt-get install ros-melodic-jsk-recognition
sudo apt-get install ros-melodic-jsk-common-msgs
sudo apt-get install ros-melodic-jsk-rviz-plugins

Buildg Our Package

Thereafter, compile this package. We use catkin tools,

mkdir -p ~/catkin_ws/src
cd ~/catkin_ws/src
git clone https://github.com/LimHyungTae/ERASOR.Official.git
cd .. && catkin build erasor

Python Setting

Our metric calculation for PR/RR code is implemented by python2.7
To run the python code, following pakages are necessary: pypcd, tqdm, scikit-learn, and tabulate

pip install pypcd
pip install tqdm	
pip install scikit-learn
pip install tabulate

Prepared dataset

Download the preprocessed KITTI data encoded into rosbag.
The downloading process might take five minutes or so. All rosbags requires total 2.3G of storage space

wget https://urserver.kaist.ac.kr/publicdata/erasor/rosbag/00_4390_to_4530_w_interval_2_node.bag
wget https://urserver.kaist.ac.kr/publicdata/erasor/rosbag/01_150_to_250_w_interval_1_node.bag
wget https://urserver.kaist.ac.kr/publicdata/erasor/rosbag/02_860_to_950_w_interval_2_node.bag
wget https://urserver.kaist.ac.kr/publicdata/erasor/rosbag/05_2350_to_2670_w_interval_2_node.bag
wget https://urserver.kaist.ac.kr/publicdata/erasor/rosbag/07_630_to_820_w_interval_2_node.bag

Description of Preprocessed Rosbag Files

Please note that the rosbag consists of node. Refer to msg/node.msg.
Note that each label of the point is assigned in intensity for the sake of convenience.
And we set the following classes are dynamic classes:

# 252: "moving-car"
# 253: "moving-bicyclist"
# 254: "moving-person"
# 255: "moving-motorcyclist"
# 256: "moving-on-rails"
# 257: "moving-bus"
# 258: "moving-truck"
# 259: "moving-other-vehicle"

Please refer to std::vector DYNAMIC_CLASSES in our code :).

How to Run ERASOR

We will explain how to run our code on seq 05 of the KITTI dataset as an example.

Step 1. Build naive map

Set the following parameters in launch/mapgen.launch.
- target_rosbag: The name of target rosbag, e.g. 05_2350_to_2670_w_interval_2_node.bag
- save_path: The path where the naively accumulated map is saved.
Launch mapgen.launch and play corresponding rosbag on the other bash as follows:

roscore # (Optional)
roslaunch erasor mapgen.launch
rosbag play 05_2350_to_2670_w_interval_2_node.bag

Then, dense map and voxelized map are auto-saved at the save path. Note that the dense map is used to fill corresponding labels (HERE). The voxelized map will be an input of step 2 as a naively accumulated map.

Step 2. Run ERASOR

Set the following parameters in config/seq_05.yaml.
- initial_map_path: The path of naively accumulated map
- save_path: The path where the filtered static map is saved.
Run the following command for each bash.

roscore # (Optional)
roslaunch erasor run_erasor.launch target_seq:="05"
rosbag play 05_2350_to_2672_w_interval_2_node.bag

IMPORTANT: After finishing running ERASOR, run the following command to save the static map as a pcd file on another bash.
"0.2" denotes voxelization size.

rostopic pub /saveflag std_msgs/Float32 "data: 0.2"

Then, you can see the printed command as follows:

The results will be saved under the save_path folder, i.e. $save_path$/05_result.pcd.

Calculate PR/RR

You can check our results directly.

First, download all pcd materials.

wget https://urserver.kaist.ac.kr/publicdata/erasor/erasor_paper_pcds.zip
unzip erasor_paper_pcds.zip

Then, run the analysis code as follows:

python analysis.py --gt $GT_PCD_PATH$ --est $EST_PCD_PATH$

E.g,

python analysis.py --gt /home/shapelim/erasor_paper_pcds/gt/05_voxel_0_2.pcd --est /home/shapelim/erasor_paper_pcds/estimate/05_ERASOR.pcd

NOTE: For estimating PR/RR, more dense pcd file, which is generated in the mapgen.launch procedure, is better to estimate PR/RR precisely.

Benchmark

Error metrics are a little bit different from those in the paper:

Seq. PR [%] RR [%]

00 91.72 97.00

01 91.93 94.63

02 81.08 99.11

05 86.98 97.88

07 92.00 98.33
But we provide all pcd files! Don't worry. See Visualization of All the State-of-the-arts Section.

Seq.	PR [%]	RR [%]
00	91.72	97.00
01	91.93	94.63
02	81.08	99.11
05	86.98	97.88
07	92.00	98.33

Run Your Own Code

⚠️ TBU: The code is already in this repository, yet the explanation is incomplete.

Visualization of All the State-of-the-arts

First, download all pcd materials.

wget https://urserver.kaist.ac.kr/publicdata/erasor/erasor_paper_pcds.zip
unzip erasor_paper_pcds.zip

Set parameters in config/viz_params.yaml correctly
- abs_dir: The absolute directory of pcd directory
- seq: Target sequence (00, 01, 02, 05, or 07)
After setting the parameters, launch following command:

roslaunch erasor compare_results.launch

Then you can inspect all pcd results that are already parsed into static points and dynamic points.
All examples are here:

Citation

If you use our code or method in your work, please consider citing the following:

@article{lim2021erasor,
title={ERASOR: Egocentric Ratio of Pseudo Occupancy-Based Dynamic Object Removal for Static 3D Point Cloud Map Building},
author={Lim, Hyungtae and Hwang, Sungwon and Myung, Hyun},
journal={IEEE Robotics and Automation Letters},
volume={6},
number={2},
pages={2272--2279},
year={2021},
publisher={IEEE}
}

This is the official pytorch implementation for our ICCV 2021 paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering" on VQA Task

Related tags

Overview

🌈 ERASOR (RA-L'21 with ICRA Option)

Contents

Test Env.

Requirements

ROS Setting

Buildg Our Package

Python Setting

Prepared dataset

Description of Preprocessed Rosbag Files

How to Run ERASOR

Calculate PR/RR

Benchmark

Run Your Own Code

Visualization of All the State-of-the-arts

Citation

Owner

Hyungtae Lim

NVIDIA Merlin is an open source library providing end-to-end GPU-accelerated recommender systems, from feature engineering and preprocessing to training deep learning models and running inference in production.

Multi-Target Adversarial Frameworks for Domain Adaptation in Semantic Segmentation

A Low Complexity Speech Enhancement Framework for Full-Band Audio (48kHz) based on Deep Filtering.

A tensorflow=1.13 implementation of Deconvolutional Networks on Graph Data (NeurIPS 2021)

95.47% on CIFAR10 with PyTorch

the code used for the preprint Embedding-based Instance Segmentation of Microscopy Images.

MAGMA - a GPT-style multimodal model that can understand any combination of images and language

FFCV: Fast Forward Computer Vision (and other ML workloads!)

Python library for science observations from the James Webb Space Telescope

data/code repository of "C2F-FWN: Coarse-to-Fine Flow Warping Network for Spatial-Temporal Consistent Motion Transfer"

Official Keras Implementation for UNet++ in IEEE Transactions on Medical Imaging and DLMIA 2018

[ICLR2021oral] Rethinking Architecture Selection in Differentiable NAS

Combine Tacotron2 and Hifi GAN to generate speech from text

Official repo for our 3DV 2021 paper "Monocular 3D Reconstruction of Interacting Hands via Collision-Aware Factorized Refinements".

A Python library for Deep Graph Networks

Official Code Release for "CLIP-Adapter: Better Vision-Language Models with Feature Adapters"

A smart Chat bot that can help to know about corona virus and Make prediction of corona using X-ray.

A generalized framework for prototyping full-stack cooperative driving automation applications under CARLA+SUMO.

CTRMs: Learning to Construct Cooperative Timed Roadmaps for Multi-agent Path Planning in Continuous Spaces

Implementation of DropLoss for Long-Tail Instance Segmentation in Pytorch