Implementation of the "PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences" paper.

Last update: Dec 09, 2022

Overview

PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences

Introduction

Point cloud sequences are irregular and unordered in the spatial dimension while exhibiting regularities and order in the temporal dimension. Therefore, existing grid based convolutions for conventional video processing cannot be directly applied to spatio-temporal modeling of raw point cloud sequences. In the paper, we propose a point spatio-temporal (PST) convolution to achieve informative representations of point cloud sequences. The proposed PST convolution first disentangles space and time in point cloud sequences. Then, a spatial convolution is employed to capture the local structure of points in the 3D space, and a temporal convolution is used to model the dynamics of the spatial regions along the time dimension. Furthermore, we incorporate the proposed PST convolution into a deep network, namely PSTNet, to extract features of 3D point cloud sequences in a spatio-temporally hierarchical manner.

Installation

The code is tested with Red Hat Enterprise Linux Workstation release 7.7 (Maipo), g++ (GCC) 8.3.1, PyTorch v1.2, CUDA 10.2 and cuDNN v7.6.

Install PyTorch v1.2:

pip install torch==1.2.0 torchvision==0.4.0

Compile the CUDA layers for PointNet++, which we used for furthest point sampling (FPS) and radius neighbouring search:

cd modules
python setup.py install

To see if the compilation is successful, try to run python modules/pst_convolutions.py to see if a forward pass works.

Install Mayavi for point cloud visualization (optional). Desktop is required.

Citation

If you find our work useful in your research, please consider citing:

@inproceedings{fan2021pstnet,
    title={PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences},
    author={Hehe Fan and Xin Yu and Yuhang Ding and Yi Yang and Mohan Kankanhalli},
    booktitle={International Conference on Learning Representations},
    year={2021}
}

Related Repos

PointNet++ PyTorch implementation: https://github.com/facebookresearch/votenet/tree/master/pointnet2
MeteorNet: https://github.com/xingyul/meteornet
3DV: https://github.com/3huo/3DV-Action

Implementation of the "PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences" paper.

Related tags

Overview

PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences

Introduction

Installation

Citation

Related Repos

Owner

Hehe Fan

Improving Calibration for Long-Tailed Recognition (CVPR2021)

NaijaSenti is an open-source sentiment and emotion corpora for four major Nigerian languages

Capstone-Project-2 - A game program written in the Python language

Project page for the paper Semi-Supervised Raw-to-Raw Mapping 2021.

Official implementation for Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos

This repository contains code, network definitions and pre-trained models for working on remote sensing images using deep learning

NeuralForecast is a Python library for time series forecasting with deep learning models

Image Super-Resolution by Neural Texture Transfer

git《Learning Pairwise Inter-Plane Relations for Piecewise Planar Reconstruction》(ECCV 2020) GitHub:

Python Implementation of the CoronaWarnApp (CWA) Event Registration

Consistency Regularization for Adversarial Robustness

CTRL-C: Camera calibration TRansformer with Line-Classification

GitHub repository for the ICLR Computational Geometry & Topology Challenge 2021

3D-CariGAN: An End-to-End Solution to 3D Caricature Generation from Normal Face Photos

BasicVSR: The Search for Essential Components in Video Super-Resolution and Beyond

Diverse Image Captioning with Context-Object Split Latent Spaces (NeurIPS 2020)

Unofficial implementation of Point-Unet: A Context-Aware Point-Based Neural Network for Volumetric Segmentation

This Artificial Intelligence program can take a black and white/grayscale image and generate a realistic or plausible colorized version of the same picture.

MediaPipe Kullanarak İleri Seviye Bilgisayarla Görü

Code for classifying international patents based on the text of their titles/abstracts