ConvLSTM_pytorch

This file contains the implementation of Convolutional LSTM in PyTorch made by me and DavideA.

We started from this implementation and heavily refactored it add added features to match our needs.

Please note that in this repository we implement the following dynamics:

which is a bit different from the one in the original paper.

How to Use

The ConvLSTM module derives from nn.Module so it can be used as any other PyTorch module.

The ConvLSTM class supports an arbitrary number of layers. In this case, it can be specified the hidden dimension (that is, the number of channels) and the kernel size of each layer. In the case more layers are present but a single value is provided, this is replicated for all the layers. For example, in the following snippet each of the three layers has a different hidden dimension but the same kernel size.

Example usage:

model = ConvLSTM(input_dim=channels,
                 hidden_dim=[64, 64, 128],
                 kernel_size=(3, 3),
                 num_layers=3,
                 batch_first=True
                 bias=True,
                 return_all_layers=False)

TODO (in progress...)

Comment code
Add docs
Add example usage on a toy problem
Implement stateful mechanism
...

Disclaimer

This is still a work in progress and is far from being perfect: if you find any bug please don't hesitate to open an issue.

Implementation of Convolutional LSTM in PyTorch.

Related tags

Overview

ConvLSTM_pytorch

How to Use

TODO (in progress...)

Disclaimer

Owner

Andrea Palazzi

CIFS: Improving Adversarial Robustness of CNNs via Channel-wise Importance-based Feature Selection

This is the repository for The Machine Learning Workshops, published by AI DOJO

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

PyTorch implementation for Convolutional Networks with Adaptive Inference Graphs

TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.

CFNet: Cascade and Fused Cost Volume for Robust Stereo Matching（CVPR2021）

This is a collection of simple PyTorch implementations of neural networks and related algorithms. These implementations are documented with explanations,

An attempt at the implementation of GLOM, Geoffrey Hinton's paper for emergent part-whole hierarchies from data

Semi-SDP Semi-supervised parser for semantic dependency parsing.

KIND: an Italian Multi-Domain Dataset for Named Entity Recognition

A project to make Amazon Echo respond to sign language using your webcam

This is a computer vision based implementation of the popular childhood game 'Hand Cricket/Odd or Even' in python

A Python library for Deep Probabilistic Modeling

Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow

Empowering journalists and whistleblowers

Luminous is a framework for testing the performance of Embodied AI (EAI) models in indoor tasks.

Localizing Visual Sounds the Hard Way

PyTorch code for the paper "Complementarity is the King: Multi-modal and Multi-grained Hierarchical Semantic Enhancement Network for Cross-modal Retrieval".

Convert onnx models to pytorch.

MAGMA - a GPT-style multimodal model that can understand any combination of images and language