🔥RandLA-Net in Tensorflow (CVPR 2020, Oral & IEEE TPAMI 2021)

Last update: Dec 30, 2022

Overview

RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds (CVPR 2020)

This is the official implementation of RandLA-Net (CVPR2020, Oral presentation), a simple and efficient neural architecture for semantic segmentation of large-scale 3D point clouds. For technical details, please refer to:

RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds
Qingyong Hu, Bo Yang*, Linhai Xie, Stefano Rosa, Yulan Guo, Zhihua Wang, Niki Trigoni, Andrew Markham.
[Paper] [Video] [Blog] [Project page]

(1) Setup

This code has been tested with Python 3.5, Tensorflow 1.11, CUDA 9.0 and cuDNN 7.4.1 on Ubuntu 16.04.

Clone the repository

git clone --depth=1 https://github.com/QingyongHu/RandLA-Net && cd RandLA-Net

Setup python environment

conda create -n randlanet python=3.5
source activate randlanet
pip install -r helper_requirements.txt
sh compile_op.sh

Update 03/21/2020, pre-trained models and results are available now. You can download the pre-trained models and results here. Note that, please specify the model path in the main function (e.g., main_S3DIS.py) if you want to use the pre-trained model and have a quick try of our RandLA-Net.

(2) S3DIS

S3DIS dataset can be found here. Download the files named "Stanford3dDataset_v1.2_Aligned_Version.zip". Uncompress the folder and move it to /data/S3DIS.

Preparing the dataset:

python utils/data_prepare_s3dis.py

Start 6-fold cross validation:

sh jobs_6_fold_cv_s3dis.sh

Move all the generated results (*.ply) in /test folder to /data/S3DIS/results, calculate the final mean IoU results:

python utils/6_fold_cv.py

Quantitative results of different approaches on S3DIS dataset (6-fold cross-validation):

Qualitative results of our RandLA-Net:

(3) Semantic3D

7zip is required to uncompress the raw data in this dataset, to install p7zip:

sudo apt-get install p7zip-full

Download and extract the dataset. First, please specify the path of the dataset by changing the BASE_DIR in "download_semantic3d.sh"

sh utils/download_semantic3d.sh

Preparing the dataset:

python utils/data_prepare_semantic3d.py

Start training:

python main_Semantic3D.py --mode train --gpu 0

Evaluation:

python main_Semantic3D.py --mode test --gpu 0

Quantitative results of different approaches on Semantic3D (reduced-8):

Qualitative results of our RandLA-Net:

Note:

Preferably with more than 64G RAM to process this dataset due to the large volume of point cloud

(4) SemanticKITTI

SemanticKITTI dataset can be found here. Download the files related to semantic segmentation and extract everything into the same folder. Uncompress the folder and move it to /data/semantic_kitti/dataset.

Preparing the dataset:

python utils/data_prepare_semantickitti.py

Start training:

python main_SemanticKITTI.py --mode train --gpu 0

Evaluation:

sh jobs_test_semantickitti.sh

Quantitative results of different approaches on SemanticKITTI dataset:

Qualitative results of our RandLA-Net:

(5) Demo

Citation

If you find our work useful in your research, please consider citing:

@article{hu2019randla,
  title={RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds},
  author={Hu, Qingyong and Yang, Bo and Xie, Linhai and Rosa, Stefano and Guo, Yulan and Wang, Zhihua and Trigoni, Niki and Markham, Andrew},
  journal={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
  year={2020}
}

@article{hu2021learning,
  title={Learning Semantic Segmentation of Large-Scale Point Clouds with Random Sampling},
  author={Hu, Qingyong and Yang, Bo and Xie, Linhai and Rosa, Stefano and Guo, Yulan and Wang, Zhihua and Trigoni, Niki and Markham, Andrew},
  journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
  year={2021},
  publisher={IEEE}
}

Acknowledgment

Part of our code refers to nanoflann library and the the recent work KPConv.
We use blender to make the video demo.

License

Licensed under the CC BY-NC-SA 4.0 license, see LICENSE.

Updates

21/03/2020: Updating all experimental results
21/03/2020: Adding pretrained models and results
02/03/2020: Code available!
15/11/2019: Initial release！

🔥RandLA-Net in Tensorflow (CVPR 2020, Oral & IEEE TPAMI 2021)

Related tags

Overview

RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds (CVPR 2020)

(1) Setup

(2) S3DIS

(3) Semantic3D

(4) SemanticKITTI

(5) Demo

Citation

Acknowledgment

License

Updates

Related Repos

Owner

Qingyong

Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms

Implementation of the paper "Shapley Explanation Networks"

Clairvoyance: a Unified, End-to-End AutoML Pipeline for Medical Time Series

For the paper entitled ''A Case Study and Qualitative Analysis of Simple Cross-Lingual Opinion Mining''

CM-NAS: Cross-Modality Neural Architecture Search for Visible-Infrared Person Re-Identification (ICCV2021)

Automatic Image Background Subtraction

Conformer: Local Features Coupling Global Representations for Visual Recognition

Language models are open knowledge graphs ( non official implementation )

InsCLR: Improving Instance Retrieval with Self-Supervision

Streaming Anomaly Detection Framework in Python (Outlier Detection for Streaming Data)

Code for "The Intrinsic Dimension of Images and Its Impact on Learning" - ICLR 2021 Spotlight

PyTorch implementation of ICLR 2022 paper PiCO: Contrastive Label Disambiguation for Partial Label Learning

Paddle implementation for "Cross-Lingual Word Embedding Refinement by ℓ1 Norm Optimisation" (NAACL 2021)

For storing the complete exploration of Visual Question Answering for our B.Tech Project

Self-Supervised Deep Blind Video Super-Resolution

MQBench Quantization Aware Training with PyTorch

这是一个yolo3-tf2的源码，可以用于训练自己的模型。

A toy project using OpenCV and PyMunk

FlowTorch is a PyTorch library for learning and sampling from complex probability distributions using a class of methods called Normalizing Flows

Data Preparation, Processing, and Visualization for MoVi Data