Diffusion Probabilistic Models for 3D Point Cloud Generation (CVPR 2021)

Last update: Jan 05, 2023

Overview

Diffusion Probabilistic Models for 3D Point Cloud Generation

The official code repository for our CVPR 2021 paper "Diffusion Probabilistic Models for 3D Point Cloud Generation".

Installation

[Option 1] Install via conda environment YAML file (CUDA 10.1).

# Create the environment
conda env create -f env.yml
# Activate the environment
conda activate dpm-pc-gen

[Option 2] Or you may setup the environment manually (If you are using GPUs that only work with CUDA 11 or greater).

Our model only depends on the following commonly used packages, all of which can be installed via conda.

Package	Version
PyTorch	≥ 1.6.0
h5py	not specified (we used 4.61.1)
tqdm	not specified
tensorboard	not specified (we used 2.5.0)
numpy	not specified (we used 1.20.2)
scipy	not specified (we used 1.6.2)
scikit-learn	not specified (we used 0.24.2)

About the EMD Metric

We have removed the EMD module due to GPU compatability issues. The legacy code can be found on the emd-cd branch.

If you have to compute the EMD score or compare our model with others, we strongly advise you to use your own code to compute the metrics. The generation and decoding results will be saved to the results folder after each test run.

Datasets and Pretrained Models

Datasets and pretrained models are available at: https://drive.google.com/drive/folders/1Su0hCuGFo1AGrNb_VMNnlF7qeQwKjfhZ

Training

# Train an auto-encoder
python train_ae.py 

# Train a generator
python train_gen.py

You may specify the value of arguments. Please find the available arguments in the script.

Note that --categories can take all (use all the categories in the dataset), airplane, chair (use a single category), or airplane,chair (use multiple categories, separated by commas).

Testing

# Test an auto-encoder
python test_ae.py --ckpt ./pretrained/AE_all.pt --categories all

# Test a generator
python test_gen.py --ckpt ./pretrained/GEN_airplane.pt --categories airplane

Citation

@inproceedings{luo2021diffusion,
  author = {Luo, Shitong and Hu, Wei},
  title = {Diffusion Probabilistic Models for 3D Point Cloud Generation},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
  month = {June},
  year = {2021}
}

Diffusion Probabilistic Models for 3D Point Cloud Generation (CVPR 2021)

Related tags

Overview

Diffusion Probabilistic Models for 3D Point Cloud Generation

Installation

About the EMD Metric

Datasets and Pretrained Models

Training

Testing

Citation

Owner

Shitong Luo

Time Series Cross-Validation -- an extension for scikit-learn

The DL Streamer Pipeline Zoo is a catalog of optimized media and media analytics pipelines.

ByteTrack: Multi-Object Tracking by Associating Every Detection Box

Generative Query Network (GQN) in PyTorch as described in "Neural Scene Representation and Rendering"

TOOD: Task-aligned One-stage Object Detection, ICCV2021 Oral

PySOT - SenseTime Research platform for single object tracking, implementing algorithms like SiamRPN and SiamMask.

Official code for paper Exemplar Based 3D Portrait Stylization.

Python binding for Khiva library.

Implementation of Diverse Semantic Image Synthesis via Probability Distribution Modeling

Veri Setinizi Yolov5 Formatına Dönüştürün

SpecAugmentPyTorch - A Pytorch (support batch and channel) implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

Embodied Intelligence via Learning and Evolution

Official code for the ICCV 2021 paper "DECA: Deep viewpoint-Equivariant human pose estimation using Capsule Autoencoders"

YOLOv3 in PyTorch > ONNX > CoreML > TFLite

A tight inclusion function for continuous collision detection

Performant, differentiable reinforcement learning

Offcial repository for the IEEE ICRA 2021 paper Auto-Tuned Sim-to-Real Transfer.

[ICCV 2021] Target Adaptive Context Aggregation for Video Scene Graph Generation

CSKG is a commonsense knowledge graph that combines seven popular sources into a consolidated representation

DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generative Transformers