[arXiv'22] Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation

Last update: Dec 16, 2022

Related tags

Overview

Panoptic NeRF

Project Page | Paper | Dataset

Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation
Xiao Fu*, Shangzhan zhang*, Tianrun Chen, Yichong Lu, Lanyun Zhu, Xiaowei Zhou, Andreas Geiger, Yiyi Liao
arXiv 2022

Installation

Create a virtual environment via conda.

conda create -n panopticnerf python=3.7
conda activate panopticnerf

Install torch and torchvision.

conda install pytorch torchvision torchaudio cudatoolkit=11.3 -c pytorch

Install requirements.
```
pip install -r requirements.txt
```

Data Preparation

We evaluate our model on KITTI-360. Here we show the structure of a test dataset as follow. You can download it from here and then put it into $ROOT (RGBs should query the KITTI-360 website).

├── KITTI-360
  ├── 2013_05_28_drive_0000_sync
    ├── image_00
    ├── image_01
  ├── bbx_intersection
    ├── *_00.npz
    ├── *_01.npz
  ├── calibration
    ├── calib_cam_to_pose.txt
    ├── perspective.txt
  ├── data_3d_bboxes
  ├── data_poses
    ├── cam0_to_world.txt
    ├── poses.txt
  ├── pspnet
  ├── sgm
  ├── visible_id

file	Intro
`image_00/01`	stereo RGB images
`pspnet`	2D pseudo ground truth
`sgm`	weak stereo depth supervision
`visible_id`	per-frame bounding primitive IDs
`data_poses`	system poses in a global Euclidean coordinate
`calibration`	extrinsics and intrinsics of the perspective cameras
`bbx_intersection`	ray-mesh intersections, containing depths between hitting points and camera origin, semantic label IDs and bounding primitive IDs

Generate ray-mesh intersections (bbx_intersection/*.npz). The red dots and blue dots indicate where the rays hit into and out of the meshes, respectively. For the given test scene, START=3353, NUM=64.

# image_00
python mesh_intersection.py intersection_start_frame ${START} intersection_frames ${NUM} use_stereo False
# image_01
python mesh_intersection.py intersection_start_frame ${START} intersection_frames ${NUM} use_stereo True

Evaluate the origin of a scene (center_pose) and the distance from the origin to the furthest bounding primitive (dist_min). Then accordingly modify the .yaml file.
```
python recenter_pose.py recenter_start_frame ${START} recenter_frames ${NUM}
```

Training and Visualization

We provide the training code. Replace resume False with resume True to load the pretained model.

python train_net.py --cfg_file configs/panopticnerf_test.yaml pretrain nerf gpus '1,' use_stereo True use_pspnet True use_depth True pseudo_filter True weight_th 0.05 resume False

Render semantic map, panoptic map and depth map in a single forward pass, which takes around 10s per-frame on a single 3090 GPU. Please make sure to maximize the GPU memory utilization by increasing the size of the chunk to reduce inference time. Replace use_stereo False with use_stereo True to render the right views.
```
python run.py --type visualize --cfg_file configs/panopticnerf_test.yaml use_stereo False
```
Visualize novel view appearance & label synthesis. Before rendering, select a frame and generate corresponding ray-mesh intersections with respect to its novel spiral poses by enabling spiral poses==True in lib.datasets.kitti360.panopticnerf.py.

Evaluation

├── KITTI-360
  ├── gt_2d_semantics
  ├── gt_2d_panoptics
  ├── lidar_depth

Download the corresponding pretrained model and put it to $ROOT/data/trained_model/panopticnerf/panopticnerf_test/latest.pth.
We provide some semantic & panoptic GTs and LiDAR point clouds for evaluation. The details of evaluation metrics can be found in the paper.
Eval mean intersection-over-union (mIoU)

python run.py --type eval_miou --cfg_file configs/panopticnerf_test.yaml use_stereo False

Eval panoptic quality (PQ)

sh eval_pq_test.sh

Eval depth with 0-100m LiDAR point clouds, where the far depth can be adjusted to evaluate the closer scene.

python run.py --type eval_depth --cfg_file configs/panopticnerf_test.yaml use_stereo False max_depth 100.

Eval Multi-view Consistency (MC)

python eval_consistency.py --cfg_file configs/panopticnerf_test.yaml use_stereo False consistency_thres 0.1

News

12/04/2022 Code released.
29/03/2022 Repo created. Code will come soon.

Citation

@article{fu2022panoptic,
  title={Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation},
  author={Fu, Xiao and Zhang, Shangzhan and Chen, Tianrun and Lu, Yichong and Zhu, Lanyun and Zhou, Xiaowei and Geiger, Andreas and Liao, Yiyi},
  journal={arXiv preprint arXiv:2203.15224},
  year={2022}
}

[arXiv'22] Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation

Related tags

Overview

Panoptic NeRF

Project Page | Paper | Dataset

Installation

Data Preparation

Training and Visualization

Evaluation

News

Citation

Owner

Xiao Fu

Code for our EMNLP 2021 paper “Heterogeneous Graph Neural Networks for Keyphrase Generation”

Source code and Dataset creation for the paper "Neural Symbolic Regression That Scales"

FairyTailor: Multimodal Generative Framework for Storytelling

Studying Python release adoptions by looking at PyPI downloads

EasyMocap is an open-source toolbox for markerless human motion capture from RGB videos.

Official repository for ABC-GAN

blind SQLIpy sebuah alat injeksi sql yang menggunakan waktu sql untuk mendapatkan sebuah server database.

Pop-Out Motion: 3D-Aware Image Deformation via Learning the Shape Laplacian (CVPR 2022)

DecoupledNet is semantic segmentation system which using heterogeneous annotations

RL-driven agent playing tic-tac-toe on starknet against challengers.

Joint parameterization and fitting of stroke clusters

A collection of scripts I developed for personal and working projects.

NVIDIA Deep Learning Examples for Tensor Cores

A 10000+ hours dataset for Chinese speech recognition

Official code for our CVPR '22 paper "Dataset Distillation by Matching Training Trajectories"

Artstation-Artistic-face-HQ Dataset (AAHQ)

[ICML 2021] A fast algorithm for fitting robust decision trees.

This is a beginner-friendly repo to make a collection of some unique and awesome projects. Everyone in the community can benefit & get inspired by the amazing projects present over here.

Improving Query Representations for DenseRetrieval with Pseudo Relevance Feedback:A Reproducibility Study.

Implementation of the final project of the course DDA6309 Probabilistic Graphical Model