PyTorch implementation of NIPS 2017 paper Dynamic Routing Between Capsules

Last update: Dec 24, 2022

Overview

Dynamic Routing Between Capsules - PyTorch implementation

PyTorch implementation of NIPS 2017 paper Dynamic Routing Between Capsules from Sara Sabour, Nicholas Frosst and Geoffrey E. Hinton.

The hyperparameters and data augmentation strategy strictly follow the paper.

Requirements

Only PyTorch with torchvision is required (tested on pytorch 0.2.0 and 0.3.0). Jupyter and matplotlib is required to run the notebook with visualizations.

Usage

Train the model by running

python net.py

Optional arguments and default values:

  --batch-size N          input batch size for training (default: 128)
  --test-batch-size N     input batch size for testing (default: 1000)
  --epochs N              number of epochs to train (default: 250)
  --lr LR                 learning rate (default: 0.001)
  --no-cuda               disables CUDA training
  --seed S                random seed (default: 1)
  --log-interval N        how many batches to wait before logging training
                          status (default: 10)
  --routing_iterations    number of iterations for routing algorithm (default: 3)
  --with_reconstruction   should reconstruction layers be used

MNIST dataset will be downloaded automatically.

Results

The network trained with reconstruction and 3 routing iterations on MNIST dataset achieves 99.65% accuracy on test set. The test loss is still slightly decreasing, so the accuracy could probably be improved with more training and more careful learning rate schedule.

Visualizations

We can create visualizations of digit reconstructions from DigitCaps (e.g. Figure 3 in the paper)

We can also visualize what each dimension of digit capsule represents (Section 5.1, Figure 4 in the paper).

Below, each row shows the reconstruction when one of the 16 dimensions in the DigitCaps representation is tweaked by intervals of 0.05 in the range [−0.25, 0.25].

We can see what individual dimensions represent for digit 7, e.g. dim6 - stroke thickness, dim11 - digit width, dim 15 - vertical shift.

Visualization examples are provided in a jupyter notebook

PyTorch implementation of NIPS 2017 paper Dynamic Routing Between Capsules

Related tags

Overview

Dynamic Routing Between Capsules - PyTorch implementation

Requirements

Usage

Results

Visualizations

Owner

Adam Bielski

Official Implementation of "LUNAR: Unifying Local Outlier Detection Methods via Graph Neural Networks"

Source code of the paper Meta-learning with an Adaptive Task Scheduler.

TyXe: Pyro-based BNNs for Pytorch users

An introduction to satellite image analysis using Python + OpenCV and JavaScript + Google Earth Engine

[CVPR 2022] PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision (Oral)

PyTorch code of my WACV 2022 paper Improving Model Generalization by Agreement of Learned Representations from Data Augmentation

Spectrum Surveying: Active Radio Map Estimation with Autonomous UAVs

A Light in the Dark: Deep Learning Practices for Industrial Computer Vision

A library for Deep Learning Implementations and utils

Teaching end to end workflow of deep learning

Making self-supervised learning work on molecules by using their 3D geometry to pre-train GNNs. Implemented in DGL and Pytorch Geometric.

Feup-csr - Repository holding my group's submission to the CSR project competition

Aircraft design optimization made fast through modern automatic differentiation

Dynamica causal Bayesian optimisation

Laser device for neutralizing - mosquitoes, weeds and pests

Experimental Python implementation of OpenVINO Inference Engine (very slow, limited functionality). All codes are written in Python. Easy to read and modify.

RL algorithm PPO and IRL algorithm AIRL written with Tensorflow.

Learning the Beauty in Songs: Neural Singing Voice Beautifier; ACL 2022 (Main conference); Official code

TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

RTSeg: Real-time Semantic Segmentation Comparative Study

PyTorch implementation of NIPS 2017 paper Dynamic Routing Between Capsules

Related tags

Overview

Dynamic Routing Between Capsules - PyTorch implementation

Requirements

Usage

Results

Visualizations

Owner

Adam Bielski

Official Implementation of "LUNAR: Unifying Local Outlier Detection Methods via Graph Neural Networks"

Source code of the paper Meta-learning with an Adaptive Task Scheduler.

TyXe: Pyro-based BNNs for Pytorch users

An introduction to satellite image analysis using Python + OpenCV and JavaScript + Google Earth Engine

[CVPR 2022] PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision (Oral)

PyTorch code of my WACV 2022 paper Improving Model Generalization by Agreement of Learned Representations from Data Augmentation

Spectrum Surveying: Active Radio Map Estimation with Autonomous UAVs

A Light in the Dark: Deep Learning Practices for Industrial Computer Vision

A library for Deep Learning Implementations and utils

Teaching end to end workflow of deep learning

Making self-supervised learning work on molecules by using their 3D geometry to pre-train GNNs. Implemented in DGL and Pytorch Geometric.

Feup-csr - Repository holding my group's submission to the CSR project competition

Aircraft design optimization made fast through modern automatic differentiation

Dynamica causal Bayesian optimisation

Laser device for neutralizing - mosquitoes, weeds and pests

Experimental Python implementation of OpenVINO Inference Engine (very slow, limited functionality). All codes are written in Python. Easy to read and modify.

RL algorithm PPO and IRL algorithm AIRL written with Tensorflow.

Learning the Beauty in Songs: Neural Singing Voice Beautifier; ACL 2022 (Main conference); Official code

​TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

RTSeg: Real-time Semantic Segmentation Comparative Study

TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.