Diffusion Probabilistic Models for 3D Point Cloud Generation (CVPR 2021)

Last update: Jan 05, 2023

Overview

Diffusion Probabilistic Models for 3D Point Cloud Generation

The official code repository for our CVPR 2021 paper "Diffusion Probabilistic Models for 3D Point Cloud Generation".

Installation

[Option 1] Install via conda environment YAML file (CUDA 10.1).

# Create the environment
conda env create -f env.yml
# Activate the environment
conda activate dpm-pc-gen

[Option 2] Or you may setup the environment manually (If you are using GPUs that only work with CUDA 11 or greater).

Our model only depends on the following commonly used packages, all of which can be installed via conda.

Package	Version
PyTorch	≥ 1.6.0
h5py	not specified (we used 4.61.1)
tqdm	not specified
tensorboard	not specified (we used 2.5.0)
numpy	not specified (we used 1.20.2)
scipy	not specified (we used 1.6.2)
scikit-learn	not specified (we used 0.24.2)

About the EMD Metric

We have removed the EMD module due to GPU compatability issues. The legacy code can be found on the emd-cd branch.

If you have to compute the EMD score or compare our model with others, we strongly advise you to use your own code to compute the metrics. The generation and decoding results will be saved to the results folder after each test run.

Datasets and Pretrained Models

Datasets and pretrained models are available at: https://drive.google.com/drive/folders/1Su0hCuGFo1AGrNb_VMNnlF7qeQwKjfhZ

Training

# Train an auto-encoder
python train_ae.py 

# Train a generator
python train_gen.py

You may specify the value of arguments. Please find the available arguments in the script.

Note that --categories can take all (use all the categories in the dataset), airplane, chair (use a single category), or airplane,chair (use multiple categories, separated by commas).

Testing

# Test an auto-encoder
python test_ae.py --ckpt ./pretrained/AE_all.pt --categories all

# Test a generator
python test_gen.py --ckpt ./pretrained/GEN_airplane.pt --categories airplane

Citation

@inproceedings{luo2021diffusion,
  author = {Luo, Shitong and Hu, Wei},
  title = {Diffusion Probabilistic Models for 3D Point Cloud Generation},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
  month = {June},
  year = {2021}
}

Diffusion Probabilistic Models for 3D Point Cloud Generation (CVPR 2021)

Related tags

Overview

Diffusion Probabilistic Models for 3D Point Cloud Generation

Installation

About the EMD Metric

Datasets and Pretrained Models

Training

Testing

Citation

Owner

Shitong Luo

7th place solution of Human Protein Atlas - Single Cell Classification on Kaggle

Source code for "Roto-translated Local Coordinate Framesfor Interacting Dynamical Systems"

Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.

Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"

The 1st Place Solution of the Facebook AI Image Similarity Challenge (ISC21) : Descriptor Track.

SparseInst: Sparse Instance Activation for Real-Time Instance Segmentation, CVPR 2022

K-PLUG: Knowledge-injected Pre-trained Language Model for Natural Language Understanding and Generation in E-Commerce (EMNLP Founding 2021)

Miscellaneous and lightweight network tools

Official PyTorch implementation of "Adversarial Reciprocal Points Learning for Open Set Recognition"

Easily pull telemetry data and create beautiful visualizations for analysis.

[ACM MM 2019 Oral] Cycle In Cycle Generative Adversarial Networks for Keypoint-Guided Image Generation

SAT: 2D Semantics Assisted Training for 3D Visual Grounding, ICCV 2021 (Oral)

[NeurIPS 2020] Official repository for the project "Listening to Sound of Silence for Speech Denoising"

Pytorch Implementation of Continual Learning With Filter Atom Swapping (ICLR'22 Spolight) Paper

Dynamic Multi-scale Filters for Semantic Segmentation (DMNet ICCV'2019)

PyTorch code for our paper "Image Super-Resolution with Non-Local Sparse Attention" (CVPR2021).

Text2Art is an AI art generator powered with VQGAN + CLIP and CLIPDrawer models

[CVPR 2022 Oral] MixFormer: End-to-End Tracking with Iterative Mixed Attention

MIMO-UNet - Official Pytorch Implementation

Pytorch implementation of winner from VQA Chllange Workshop in CVPR'17