ConvLSTM_pytorch

This file contains the implementation of Convolutional LSTM in PyTorch made by me and DavideA.

We started from this implementation and heavily refactored it add added features to match our needs.

Please note that in this repository we implement the following dynamics:

which is a bit different from the one in the original paper.

How to Use

The ConvLSTM module derives from nn.Module so it can be used as any other PyTorch module.

The ConvLSTM class supports an arbitrary number of layers. In this case, it can be specified the hidden dimension (that is, the number of channels) and the kernel size of each layer. In the case more layers are present but a single value is provided, this is replicated for all the layers. For example, in the following snippet each of the three layers has a different hidden dimension but the same kernel size.

Example usage:

model = ConvLSTM(input_dim=channels,
                 hidden_dim=[64, 64, 128],
                 kernel_size=(3, 3),
                 num_layers=3,
                 batch_first=True
                 bias=True,
                 return_all_layers=False)

TODO (in progress...)

Comment code
Add docs
Add example usage on a toy problem
Implement stateful mechanism
...

Disclaimer

This is still a work in progress and is far from being perfect: if you find any bug please don't hesitate to open an issue.

Implementation of Convolutional LSTM in PyTorch.

Related tags

Overview

ConvLSTM_pytorch

How to Use

TODO (in progress...)

Disclaimer

Owner

Andrea Palazzi

Code for the CVPR 2021 paper "Triple-cooperative Video Shadow Detection"

A big endian Gentoo port developed on a Pine64.org RockPro64

This is the official implementation code repository of Underwater Light Field Retention : Neural Rendering for Underwater Imaging (Accepted by CVPR Workshop2022 NTIRE)

This repository contains the entire code for our work "Two-Timescale End-to-End Learning for Channel Acquisition and Hybrid Precoding"

Second-Order Neural ODE Optimizer, NeurIPS 2021 spotlight

Data Engineering ZoomCamp

Contenido del curso Bases de datos del DCC PUC versión 2021-2

CM building dataset Timisoara

Facebook AI Image Similarity Challenge: Descriptor Track

Audio Source Separation is the process of separating a mixture into isolated sounds from individual sources

Weakly Supervised Segmentation by Tensorflow.

EM-POSE 3D Human Pose Estimation from Sparse Electromagnetic Trackers.

unofficial pytorch implement of "Squareplus: A Softplus-Like Algebraic Rectifier"

This is a collection of our NAS and Vision Transformer work.

Models Supported: AlbUNet [18, 34, 50, 101, 152] (1D and 2D versions for Single and Multiclass Segmentation, Feature Extraction with supports for Deep Supervision and Guided Attention)

Tensorflow 2 implementation of the paper: Learning and Evaluating Representations for Deep One-class Classification published at ICLR 2021

Vrcwatch - Supply the local time to VRChat as Avatar Parameters through OSC

OMNIVORE is a single vision model for many different visual modalities

ROS Basics and TurtleSim

Randstad Artificial Intelligence Challenge (powered by VGEN). Soluzione proposta da Stefano Fiorucci (anakin87) - primo classificato