Implementation of ETSformer, state of the art time-series Transformer, in Pytorch

Last update: Dec 30, 2022

Overview

ETSformer - Pytorch

Implementation of ETSformer, state of the art time-series Transformer, in Pytorch

Install

$ pip install etsformer-pytorch

Usage

import torch
from etsformer_pytorch import ETSFormer

model = ETSFormer(
    time_features = 4,
    model_dim = 512,                # in paper they use 512
    embed_kernel_size = 3,          # kernel size for 1d conv for input embedding
    layers = 2,                     # number of encoder and corresponding decoder layers
    heads = 8,                      # number of exponential smoothing attention heads
    K = 4,                          # num frequencies with highest amplitude to keep (attend to)
    dropout = 0.2                   # dropout (in paper they did 0.2)
)

timeseries = torch.randn(1, 1024, 4)

pred = model(timeseries, num_steps_forecast = 32) # (1, 32, 4) - (batch, num steps forecast, num time features)

For using ETSFormer for classification, using cross attention pooling on all latents and level output

import torch
from etsformer_pytorch import ETSFormer, ClassificationWrapper

etsformer = ETSFormer(
    time_features = 1,
    model_dim = 512,
    embed_kernel_size = 3,
    layers = 2,
    heads = 8,
    K = 4,
    dropout = 0.2
)

adapter = ClassificationWrapper(
    etsformer = etsformer,
    dim_head = 32,
    heads = 16,
    dropout = 0.2,
    level_kernel_size = 5,
    num_classes = 10
)

timeseries = torch.randn(1, 1024)

logits = adapter(timeseries) # (1, 10)

Citation

@misc{woo2022etsformer,
    title   = {ETSformer: Exponential Smoothing Transformers for Time-series Forecasting}, 
    author  = {Gerald Woo and Chenghao Liu and Doyen Sahoo and Akshat Kumar and Steven Hoi},
    year    = {2022},
    eprint  = {2202.01381},
    archivePrefix = {arXiv},
    primaryClass = {cs.LG}
}

Comments

What are your thoughts on using latents for additional classification task
Hi! I was wondering if you have thought about aggregating seasonal and growth latents for additional tasks (for example classification)? What are the possible ways to bring latents into single feature vector in your opinion? The easiest one would be just get the mean along layers and time dimensions but that seams to be too naive. Another idea I had it to use Cross Attention mechanic with single time query key to aggregate latents:

all_latents = torch.cat([latent_growths, latent_seasonals], dim=-1) all_latents = rearrange(all_latents, 'b n l d -> (b l) n d') # q = nn.Parameter(torch.randn(all_latents_dim)) q = repeat(q, 'd -> b 1 d', b = all_latents.shape[0]) agg_latent = cross_attention(query=q, context=all_latents) agg_latent = rearrange(all_latents, '(b l) n d -> b (l n) d') agg_latent = agg_latent.mean(dim=1) # may be we should have done it before cross attention?

Would be great to hear your thoughts
opened by inspirit 15
Pre LayerNorm might be required for k,v?

https://github.com/lucidrains/ETSformer-pytorch/blob/2561053007e919409b3255eb1d0852c68799d24f/etsformer_pytorch/etsformer_pytorch.py#L440

In my early tests I see some instability in training results, I was wondering if it might be good idea to LayerNorm latents before constructing key and values?

opened by inspirit 5
growth_term calculation error

https://github.com/lucidrains/ETSformer-pytorch/blob/e1d8514b44d113ead523aa6307986833e68eecc5/etsformer_pytorch/etsformer_pytorch.py#L233-L235

It looks like you are not using growth and growth_smoothing_weightsto calculate growth_term

opened by inspirit 4
Backward gradient error
Hello,

i was trying to run the provided class and see following error: Function ScatterBackward0 returned an invalid gradient at index 1 - got [64, 4, 128] but expected shape compatible with [64, 33, 128]

model = ETSFormer( time_features = 9, model_dim = 128, embed_kernel_size = 3, layers = 2, heads = 4, K = 4, dropout = 0.2 )

input = torch.rand(64, 64, 9) x = model(input, num_steps_forecast = 16)
opened by inspirit 3
Does ETS-Former allow adding features

@lucidrains Thanks for making the code of the model available!

In your paper, you state that the model infers seasonal patterns itself, so that there is no need to add time features like week, month, etc.

Still, to increase the applicability of your approach, does the current implementation allow to add any (time-invariant and time-varying) features, e.g., categorical or numeric?

opened by StatMixedML 2
wrong order of arguments

https://github.com/lucidrains/ETSformer-pytorch/blob/2e0d465576c15fc8d84c4673f93fdd71d45b799c/etsformer_pytorch/etsformer_pytorch.py#L327

you pass latents on wrong order to Level module: according to forward method first should be growth and then seasonal

opened by inspirit 1
Clarification regarding data pre-processing

Hello,

I was trying to run the ETSformer for ETT dataset. The paper mentions that the dataset is split as 60/20/20 for train, validation and test. Could you give some insight as to how the dataset split is happening in the code.

Thank you.

opened by vageeshmaiya 2

Releases(0.0.16)

0.0.16(Mar 22, 2022)

Source code(tar.gz)
Source code(zip)
0.0.15(Mar 22, 2022)

Source code(tar.gz)
Source code(zip)
0.0.14a(Mar 22, 2022)

Source code(tar.gz)
Source code(zip)
0.0.12(Mar 20, 2022)

Source code(tar.gz)
Source code(zip)
0.0.11(Mar 20, 2022)

Source code(tar.gz)
Source code(zip)
0.0.10(Mar 20, 2022)

Source code(tar.gz)
Source code(zip)
0.0.9(Mar 20, 2022)

Source code(tar.gz)
Source code(zip)
0.0.8(Mar 20, 2022)

Source code(tar.gz)
Source code(zip)
0.0.7(Mar 19, 2022)

Source code(tar.gz)
Source code(zip)
0.0.6(Mar 18, 2022)

Source code(tar.gz)
Source code(zip)
0.0.5(Mar 17, 2022)

Source code(tar.gz)
Source code(zip)
0.0.4(Mar 17, 2022)

Source code(tar.gz)
Source code(zip)
0.0.3a(Mar 16, 2022)

Source code(tar.gz)
Source code(zip)
0.0.1(Mar 15, 2022)

Source code(tar.gz)
Source code(zip)

Owner

Phil Wang

Working with Attention. It's all we need

GitHub Repository

This is a TensorFlow implementation for C2-Rec

This is a TensorFlow implementation for C2-Rec We refer to the repo SASRec. Requirements requirement.txt Datasets This repo includes Amazon Beauty dat

7 Nov 14, 2022

Cleaned test data list of DukeMTMC-reID, ICCV2021

Cleaned DukeMTMC-reID Cleaned data list of DukeMTMC-reID released with our paper accepted by ICCV 2021: Learning Instance-level Spatial-Temporal Patte

14 Feb 19, 2022

Cerberus Transformer: Joint Semantic, Affordance and Attribute Parsing

Cerberus Transformer: Joint Semantic, Affordance and Attribute Parsing Paper Introduction Multi-task indoor scene understanding is widely considered a

62 Dec 05, 2022

TensorFlow implementation for Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How

Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How TensorFlow implementation for Bayesian Modeling and Unce

8 Sep 02, 2022

Using Self-Supervised Pretext Tasks for Active Learning - Official Pytorch Implementation

Using Self-Supervised Pretext Tasks for Active Learning - Official Pytorch Implementation Experiment Setting: CIFAR10 (downloaded and saved in ./DATA

38 Dec 27, 2022

A program to recognize fruits on pictures or videos using yolov5

Yolov5 Fruits Detector Requirements Either Linux or Windows. We recommend Linux for better performance. Python 3.6+ and PyTorch 1.7+. Installation To

30 Jan 06, 2023

Pretrained models for Jax/Haiku; MobileNet, ResNet, VGG, Xception.

Pre-trained image classification models for Jax/Haiku Jax/Haiku Applications are deep learning models that are made available alongside pre-trained we

14 Dec 20, 2022

Repository features UNet inspired architecture used for segmenting lungs on chest X-Ray images

Lung Segmentation (2D) Repository features UNet inspired architecture used for segmenting lungs on chest X-Ray images. Demo See the application of the

163 Sep 21, 2022

[CVPR'21] Learning to Recommend Frame for Interactive Video Object Segmentation in the Wild

IVOS-W Paper Learning to Recommend Frame for Interactive Video Object Segmentation in the Wild Zhaoyun Yin, Jia Zheng, Weixin Luo, Shenhan Qian, Hanli

38 Dec 12, 2022

Evaluation and Benchmarking of Speech Super-resolution Methods

Speech Super-resolution Evaluation and Benchmarking What this repo do: A toolbox for the evaluation of speech super-resolution algorithms. Unify the e

84 Dec 20, 2022

A spherical CNN for weather forecasting

DeepSphere-Weather - Deep Learning on the sphere for weather/climate applications. The code in this repository provides a scalable and flexible framew

47 Dec 25, 2022

Code needed to reproduce the examples found in "The Temporal Robustness of Stochastic Signals"

The Temporal Robustness of Stochastic Signals Code needed to reproduce the examples found in "The Temporal Robustness of Stochastic Signals" Case stud

0 Oct 28, 2021

HistoKT: Cross Knowledge Transfer in Computational Pathology

HistoKT: Cross Knowledge Transfer in Computational Pathology Exciting News! HistoKT has been accepted to ICASSP 2022. HistoKT: Cross Knowledge Transfe

5 Jan 05, 2023

Automatically creates genre collections for your Plex media

Plex Auto Genres Plex Auto Genres is a simple script that will add genre collection tags to your media making it much easier to search for genre speci

63 Dec 31, 2022

Probabilistic Programming and Statistical Inference in PyTorch

PtStat Probabilistic Programming and Statistical Inference in PyTorch. Introduction This project is being developed during my time at Cogent Labs. The

109 Nov 26, 2022

Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)

Awesome Visual-Transformer Collect some Transformer with Computer-Vision (CV) papers. If you find some overlooked papers, please open issues or pull r

2.8k Jan 08, 2023

Supplementary code for the AISTATS 2021 paper "Matern Gaussian Processes on Graphs".

Matern Gaussian Processes on Graphs This repo provides an extension for gpflow with Matérn kernels, inducing variables and trainable models implemente

41 Dec 17, 2022

Personals scripts using ageitgey/face_recognition

HOW TO USE pip3 install requirements.txt Add some pictures of known people in the folder 'people' : a) Create a folder called by the name of the perso

1 Jan 06, 2022

CLIP: Connecting Text and Image (Learning Transferable Visual Models From Natural Language Supervision)

CLIP (Contrastive Language–Image Pre-training) Experiments (Evaluation) Model Dataset Acc (%) ViT-B/32 (Paper) CIFAR100 65.1 ViT-B/32 (Our) CIFAR100 6

52 Jan 07, 2023

An investigation project for SISR.

SISR-Survey An investigation project for SISR. This repository is an official project of the paper "From Beginner to Master: A Survey for Deep Learnin

79 Oct 20, 2022

Implementation of ETSformer, state of the art time-series Transformer, in Pytorch

Related tags

Overview

ETSformer - Pytorch

Install

Usage

Citation

Comments

What are your thoughts on using latents for additional classification task

Pre LayerNorm might be required for k,v?

growth_term calculation error

Backward gradient error

Does ETS-Former allow adding features

wrong order of arguments

Clarification regarding data pre-processing

Releases(0.0.16)

0.0.16(Mar 22, 2022)

0.0.15(Mar 22, 2022)

0.0.14a(Mar 22, 2022)

0.0.12(Mar 20, 2022)

0.0.11(Mar 20, 2022)

0.0.10(Mar 20, 2022)

0.0.9(Mar 20, 2022)

0.0.8(Mar 20, 2022)

0.0.7(Mar 19, 2022)

0.0.6(Mar 18, 2022)

0.0.5(Mar 17, 2022)

0.0.4(Mar 17, 2022)

0.0.3a(Mar 16, 2022)

0.0.1(Mar 15, 2022)

Owner

Phil Wang

This is a TensorFlow implementation for C2-Rec

Cleaned test data list of DukeMTMC-reID, ICCV2021

Cerberus Transformer: Joint Semantic, Affordance and Attribute Parsing

TensorFlow implementation for Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How

Using Self-Supervised Pretext Tasks for Active Learning - Official Pytorch Implementation

A program to recognize fruits on pictures or videos using yolov5

Pretrained models for Jax/Haiku; MobileNet, ResNet, VGG, Xception.

Repository features UNet inspired architecture used for segmenting lungs on chest X-Ray images

[CVPR'21] Learning to Recommend Frame for Interactive Video Object Segmentation in the Wild

Evaluation and Benchmarking of Speech Super-resolution Methods

A spherical CNN for weather forecasting

Code needed to reproduce the examples found in "The Temporal Robustness of Stochastic Signals"

HistoKT: Cross Knowledge Transfer in Computational Pathology

Automatically creates genre collections for your Plex media

Probabilistic Programming and Statistical Inference in PyTorch

Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)

Supplementary code for the AISTATS 2021 paper "Matern Gaussian Processes on Graphs".

Personals scripts using ageitgey/face_recognition

CLIP: Connecting Text and Image (Learning Transferable Visual Models From Natural Language Supervision)

An investigation project for SISR.