Equivariant CNNs for the sphere and SO(3) implemented in PyTorch

Last update: Dec 28, 2022

Related tags

Overview

⚠️ ⚠️ This code is old and does not support the last versions of pytorch! Especially since the change in the fft interface. ⚠️ ⚠️

Spherical CNNs

Equivariant CNNs for the sphere and SO(3) implemented in PyTorch

Overview

This library contains a PyTorch implementation of the rotation equivariant CNNs for spherical signals (e.g. omnidirectional images, signals on the globe) as presented in [1]. Equivariant networks for the plane are available here.

Dependencies

PyTorch: http://pytorch.org/ (>= 0.4.0)
cupy: https://github.com/cupy/cupy
lie_learn: https://github.com/AMLab-Amsterdam/lie_learn
pynvrtc: https://github.com/NVIDIA/pynvrtc

(commands to install all the dependencies on a new conda environment)

conda create --name cuda9 python=3.6 
conda activate cuda9

# s2cnn deps
#conda install pytorch torchvision cuda90 -c pytorch # get correct command line at http://pytorch.org/
conda install -c anaconda cupy  
pip install pynvrtc joblib

# lie_learn deps
conda install -c anaconda cython  
conda install -c anaconda requests  

# shrec17 example dep
conda install -c anaconda scipy  
conda install -c conda-forge rtree shapely  
conda install -c conda-forge pyembree  
pip install "trimesh[easy]"

Installation

To install, run

$ python setup.py install

Usage

Please have a look at the examples.

Please cite [1] in your work when using this library in your experiments.

Design choices for Spherical CNN Architectures

Spherical CNNs come with different choices of grids and grid hyperparameters which are on the first look not obviously related to those of conventional CNNs. The s2_near_identity_grid and so3_near_identity_grid are the preferred choices since they correspond to spatially localized kernels, defined at the north pole and rotated over the sphere via the action of SO(3). In contrast, s2_equatorial_grid and so3_equatorial_grid define line-like (or ring-like) kernels around the equator.

To clarify the possible parameter choices for s2_near_identity_grid:

max_beta:

Adapts the size of the kernel as angle measured from the north pole. Conventional CNNs on flat space usually use a fixed kernel size but pool the signal spatially. This spatial pooling gives the kernels in later layers an effectively increased field of view. One can emulate a pooling by a factor of 2 in spherical CNNs by decreasing the signal bandwidth by 2 and increasing max_beta by 2.

n_beta:

Number of rings of the kernel around the equator, equally spaced in [β=0, β=max_beta]. The choice n_beta=1 corresponds to a small 3x3 kernel in conv2d since in both cases the resulting kernel consists of one central pixel and one ring around the center.

n_alpha:

Gives the number of learned parameters of the rings around the pole. These values are per default equally spaced on the azimuth. A sensible number of values depends on the bandwidth and max_beta since a higher resolution or spatial extent allow to sample more fine kernels without producing aliased results. In practice this value is typically set to a constant, low value like 6 or 8. A reduced bandwidth of the signal is thereby counteracted by an increased max_beta to emulate spatial pooling.

The so3_near_identity_grid has two additional parameters max_gamma and n_gamma. SO(3) can be seen as a (principal) fiber bundle SO(3)→S² with the sphere S² as base space and fiber SO(2) attached to each point. The additional parameters control the grid on the fiber in the following way:

max_gamma:

The kernel spans over the fiber SO(2) between γ∈[0, max_gamma]. The fiber SO(2) encodes the kernel responses for every sampled orientation at a given position on the sphere. Setting max_gamma≨2π results in the kernel not seeing the responses of all kernel orientations simultaneously and is in general unfavored. Steerable CNNs [3] usually always use max_gamma=2π.

n_gamma:

Number of learned parameters on the fiber. Typically set equal to n_alpha, i.e. to a low value like 6 or 8.

See the deep model of the MNIST example for an example of how to adapt these parameters over layers.

Feedback

For questions and comments, feel free to contact us: geiger.mario (gmail), taco.cohen (gmail), jonas (argmin.xyz).

License

MIT

References

[1] Taco S. Cohen, Mario Geiger, Jonas Köhler, Max Welling, Spherical CNNs. International Conference on Learning Representations (ICLR), 2018.

[2] Taco S. Cohen, Mario Geiger, Jonas Köhler, Max Welling, Convolutional Networks for Spherical Signals. ICML Workshop on Principled Approaches to Deep Learning, 2017.

[3] Taco S. Cohen, Mario Geiger, Maurice Weiler, Intertwiners between Induced Representations (with applications to the theory of equivariant neural networks), ArXiv preprint 1803.10743, 2018.

Equivariant CNNs for the sphere and SO(3) implemented in PyTorch

Related tags

Overview

Spherical CNNs

Equivariant CNNs for the sphere and SO(3) implemented in PyTorch

Overview

Dependencies

Installation

Usage

Design choices for Spherical CNN Architectures

max_beta:

n_beta:

n_alpha:

max_gamma:

n_gamma:

Feedback

License

References

Owner

Jonas Köhler

A Python package to process & model ChEMBL data.

A PyTorch-based R-YOLOv4 implementation which combines YOLOv4 model and loss function from R3Det for arbitrary oriented object detection.

🦕 NanoSaur is a little tracked robot ROS2 enabled, made for an NVIDIA Jetson Nano

Raptor-Multi-Tool - Raptor Multi Tool With Python

AVD Quickstart Containerlab

PyTorch code for our paper "Gated Multiple Feedback Network for Image Super-Resolution" (BMVC2019)

Code for "Graph-Evolving Meta-Learning for Low-Resource Medical Dialogue Generation". [AAAI 2021]

AI创造营：Metaverse启动机之重构现世，结合PaddlePaddle 和 Wechaty 创造自己的聊天机器人

Deep Learning: Architectures & Methods Project: Deep Learning for Audio Super-Resolution

PyTorch implementation and pretrained models for XCiT models. See XCiT: Cross-Covariance Image Transformer

Speech Emotion Recognition with Fusion of Acoustic- and Linguistic-Feature-Based Decisions

A curated list of awesome game datasets, and tools to artificial intelligence in games

A Blender python script for getting asset browser custom preview images for objects and collections.

A PyTorch implementation of the baseline method in Panoptic Narrative Grounding (ICCV 2021 Oral)

[NeurIPS 2021] Official implementation of paper "Learning to Simulate Self-driven Particles System with Coordinated Policy Optimization".

CAMPARI: Camera-Aware Decomposed Generative Neural Radiance Fields

Code, pre-trained models and saliency results for the paper "Boosting RGB-D Saliency Detection by Leveraging Unlabeled RGB Images".

GeoMol: Torsional Geometric Generation of Molecular 3D Conformer Ensembles

SAT Project - The first project I had done at General Assembly, performed EDA, data cleaning and created data visualizations

Equivariant CNNs for the sphere and SO(3) implemented in PyTorch

Related tags

Overview

Spherical CNNs

Equivariant CNNs for the sphere and SO(3) implemented in PyTorch

Overview

Dependencies

Installation

Usage

Design choices for Spherical CNN Architectures

max_beta:

n_beta:

n_alpha:

max_gamma:

n_gamma:

Feedback

License

References

Owner

Jonas Köhler

A Python package to process & model ChEMBL data.

A PyTorch-based R-YOLOv4 implementation which combines YOLOv4 model and loss function from R3Det for arbitrary oriented object detection.

🦕 NanoSaur is a little tracked robot ROS2 enabled, made for an NVIDIA Jetson Nano

Raptor-Multi-Tool - Raptor Multi Tool With Python

AVD Quickstart Containerlab

PyTorch code for our paper "Gated Multiple Feedback Network for Image Super-Resolution" (BMVC2019)

Code for "Graph-Evolving Meta-Learning for Low-Resource Medical Dialogue Generation". [AAAI 2021]

AI创造营 ：Metaverse启动机之重构现世，结合PaddlePaddle 和 Wechaty 创造自己的聊天机器人

Deep Learning: Architectures & Methods Project: Deep Learning for Audio Super-Resolution

PyTorch implementation and pretrained models for XCiT models. See XCiT: Cross-Covariance Image Transformer

Speech Emotion Recognition with Fusion of Acoustic- and Linguistic-Feature-Based Decisions

A curated list of awesome game datasets, and tools to artificial intelligence in games

A Blender python script for getting asset browser custom preview images for objects and collections.

A PyTorch implementation of the baseline method in Panoptic Narrative Grounding (ICCV 2021 Oral)

[NeurIPS 2021] Official implementation of paper "Learning to Simulate Self-driven Particles System with Coordinated Policy Optimization".

CAMPARI: Camera-Aware Decomposed Generative Neural Radiance Fields

Code, pre-trained models and saliency results for the paper "Boosting RGB-D Saliency Detection by Leveraging Unlabeled RGB Images".

GeoMol: Torsional Geometric Generation of Molecular 3D Conformer Ensembles

SAT Project - The first project I had done at General Assembly, performed EDA, data cleaning and created data visualizations

AI创造营：Metaverse启动机之重构现世，结合PaddlePaddle 和 Wechaty 创造自己的聊天机器人