Patch-Based Deep Autoencoder for Point Cloud Geometry Compression

Last update: Dec 05, 2022

Related tags

Overview

Patch-Based Deep Autoencoder for Point Cloud Geometry Compression

Overview

The ever-increasing 3D application makes the point cloud compression unprecedentedly important and needed. In this paper, we propose a patch-based compression process using deep learning, focusing on the lossy point cloud geometry compression. Unlike existing point cloud compression networks, which apply feature extraction and reconstruction on the entire point cloud, we divide the point cloud into patches and compress each patch independently. In the decoding process, we finally assemble the decompressed patches into a complete point cloud. In addition, we train our network by a patch-to-patch criterion, i.e., use the local reconstruction loss for optimization, to approximate the global reconstruction optimality. Our method outperforms the state-of-the-art in terms of rate-distortion performance, especially at low bitrates. Moreover, the compression process we proposed can guarantee to generate the same number of points as the input. The network model of this method can be easily applied to other point cloud reconstruction problems, such as upsampling.

Environment

Python 3.9.6 and Pytorch 1.9.0

Other dependencies:

pytorch3d 0.5.0 for KNN and chamfer loss: https://github.com/facebookresearch/pytorch3d

geo_dist for point to plane evaluation: https://github.com/mauriceqch/geo_dist

*For some unexpected reasons, we have rewritten the experimental code using a different environment and dependencies than in the paper. The training parameters and experimental results may be slightly different.

Data Preparation

You need ModelNet40 and ShapeNet to reproduce our results. The following steps will show you a general way to prepare point clouds in our experiment.

ModelNet40

Download the ModelNet40 data: http://modelnet.cs.princeton.edu

Convert CAD models(.off) to point clouds(.ply) by using sample_modelnet.py:

python ./sample_modelnet.py ./data/ModelNet40 ./data/ModelNet40_pc_8192 --n_point 8192

ShapeNet

Download the ShapeNet data here

Sampling point clouds by using sample_shapenet.py:

python ./sample_shapenet.py ./data/shapenetcore_partanno_segmentation_benchmark_v0_normal ./data/ShapeNet_pc_2048 --n_point 2048

Training

We use train_ae.py to train an autoencoder on ModelNet40 dataset:

python ./train_ae.py './data/ModelNet40_pc_8192/**/train/*.ply' './model/trained_128_16' --N 8192 --ALPHA 2 --K 128 --d 16

Compression and Decompression

We use compress.py and decompress.py to perform compress on point clouds using our trained autoencoder. Take the compression of ModelNet40 as an example:

python ./compress.py './model/trained_128_16' './data/ModelNet40_pc_8192/**/test/*.ply' './data/ModelNet40_pc_8192_compressed_128_16' --ALPHA 2

python ./decompress.py './model/trained_128_16' './data/ModelNet40_pc_8192_compressed_128_16' './data/ModelNet40_pc_8192_decompressed_128_16'

Evaluation

The Evaluation process uses the same software geo_dist as in Quach's code. We use eval.py to measure reconstruction quality and check the bitrate of the compressed file.

python ./eval.py ../geo_dist/build/pc_error './data/ModelNet40_pc_8192/**/test/*.ply' './data/ModelNet40_pc_8192_compressed_128_16' './data/ModelNet40_pc_8192_decompressed_128_16' './eval/ModelNet40_128_16.csv'

Patch-Based Deep Autoencoder for Point Cloud Geometry Compression

Related tags

Overview

Patch-Based Deep Autoencoder for Point Cloud Geometry Compression

Overview

Environment

Data Preparation

Training

Compression and Decompression

Evaluation

Owner

Another pytorch implementation of FCN (Fully Convolutional Networks)

The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understanding, by Chuhan Zhang, Ankush Gupta and Andrew Zisserman.

Collection of common code that's shared among different research projects in FAIR computer vision team.

A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

Transfer Reinforcement Learning for Differing Action Spaces via Q-Network Representations

VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image Captioning

YOLOv7 - Framework Beyond Detection

A flexible submap-based framework towards spatio-temporally consistent volumetric mapping and scene understanding.

Accelerated SMPL operation, commonly used in generate 3D human mesh, STAR included.

This repo in the implementation of EMNLP'21 paper "SPARQLing Database Queries from Intermediate Question Decompositions" by Irina Saparina, Anton Osokin

An efficient implementation of GPNN

Official implementation of "An Image is Worth 16x16 Words, What is a Video Worth?" (2021 paper)

Torchserve server using a YoloV5 model running on docker with GPU and static batch inference to perform production ready inference.

[NeurIPS 2021 Spotlight] Aligning Pretraining for Detection via Object-Level Contrastive Learning

[NeurIPS'21] Shape As Points: A Differentiable Poisson Solver

CrossNorm and SelfNorm for Generalization under Distribution Shifts (ICCV 2021)

A data-driven approach to quantify the value of classifiers in a machine learning ensemble.

Everything you need to know about NumPy( Creating Arrays, Indexing, Math,Statistics,Reshaping).

Dataset used in "PlantDoc: A Dataset for Visual Plant Disease Detection" accepted in CODS-COMAD 2020

Learning Synthetic Environments and Reward Networks for Reinforcement Learning