Deep learning PyTorch library for time series forecasting, classification, and anomaly detection

Last update: Jan 04, 2023

Overview

Deep learning for time series forecasting

Flow forecast is an open-source deep learning for time series forecasting framework. It provides all the latest state of the art models (transformers, attention models, GRUs) and cutting edge concepts with easy to understand interpretability metrics, cloud provider integration, and model serving capabilities. Flow Forecast was the first time series framework to feature support for transformer based models and remains the only true end-to-end deep learnig for time series forecasting framework. Currently Task-TS from CoronaWhy primarily maintains this repository. Pull requests are welcome. Historically, this repository provided open source benchmark and codes for flash flood and river flow forecasting.

For additional tutorials (on Colab) and examples please see our tutorials repository.

branch	status
master
Build PY
Documentation
CodeCov
CodeFactor

Getting Started

Using the library

Run pip install flood-forecast
Detailed info on training models can be found on the Wiki.
Check out our Confluence Documentation

Models currently supported

Vanilla LSTM (LSTM): A basic LSTM that is suitable for multivariate time series forecasting and transfer learning.
Full transformer (SimpleTransformer in model_dict): The full original transformer with all 8 encoder and decoder blocks. Requires passing the target in at inference.
Simple Multi-Head Attention (MultiHeadSimple): A simple multi-head attention block and linear embedding layers. Suitable for transfer learning.
Transformer with a linear decoder (CustomTransformerDecoder in model_dict): A transformer with n-encoder blocks (this is tunable) and a linear decoder.
DA-RNN: (DARNN) A well rounded model with which utilizes a LSTM + attention.
Enhancing the Locality and Breaking the Memory Bottleneck of Transformer on Time Series Forecasting (called DecoderTransformer in model_dict):
Transformer XL:
Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting (Informer)
DeepAR

Forthcoming Models

We have a number of models we are planning on releasing soon. Please check our project board for more info

Integrations

Google Cloud Platform

Weights and Biases

Contributing

For instructions on contributing please see our contributions page and our project board.

Historical River Flow Data

Task 1 Stream Flow Forecasting

This task focuses on forecasting a stream's future flow/height (in either cfs or feet respectively) given factors such as current flow, temperature, and precipitation. In the future we plan on adding more variables that help with the stream flow prediction such as snow pack data and the surrounding soil moisture index.

Task 2 Flood severity forecasting

Task two focuses on predicting the severity of the flood based on the flood forecast, population information, and topography. Flood severity is defined based on several factors including the number of injuires, property damage, and crop damage.

If you use either the data or code from this repository please use the citation below. Additionally please cite the original authors of the models.

@misc{godfried2020flowdb,
      title={FlowDB a large scale precipitation, river, and flash flood dataset}, 
      author={Isaac Godfried and Kriti Mahajan and Maggie Wang and Kevin Li and Pranjalya Tiwari},
      year={2020},
      eprint={2012.11154},
      archivePrefix={arXiv},
      primaryClass={cs.AI}
}

Comments

Informer compatibility with interpretability methods

Currently Informer does not work with the shap interpretability methods. Refactoring SHAP to work with these methods will likely require so significant refactoring. As with Informer we have the target being passed. We should also likely design a helper function to better help with this. history, _, forecast_start_idx = csv_test_loader.get_from_start_date(datetime_start) background_tensor = _prepare_background_tensor(csv_test_loader)
enhancement

opened by isaacmg 7
Inference mode for time series models
Create a predict function which does inference for for time series models without requiring the target present. This module should initialize the model using the given configuration file (with a weight path). It should be able to consume a CSV file or query a SQL table #102 (thought this functionality is not required in the initial PR). It should ideally make use of the existing evaluator.py module but

Acceptance Criteria

[ ] Passing tests

deployment
opened by isaacmg 7
about dataset

So how can i download the dataset of FlowDB Dataset? Gsutil is not working? Can you give some details for your dataset, and tell us how to use you model for gour FlowDB? Thanks!

opened by Vipermdl 6
Does datetime_start parameter in inference_params is forecasting start date?

In your Infer.ipynb datetime_start parameter is forecasting start date? (Your predict_cfs bucknet had been expired.)

'inference_params': {'dataset_params': {'file_path': 'gs://predict_cfs/day_addition/01064118KPWM_flow.csv', 'forecast_history': 8, 'forecast_length': 1, 'interpolate_param': {'method': 'back_forward', 'params': {}}, 'relevant_cols': ['cfs1', 'precip', 'temp', 'month'], 'scaling': RobustScaler(), 'sort_column': 'hour_updated', 'target_col': ['cfs1']}, 'datetime_start': '2018-05-31', 'decoder_params': {'decoder_function': 'simple_decode', 'unsqueeze_dim': 1}, 'hours_to_forecast': 336, 'num_prediction_samples': 30, 'test_csv_path': 'gs://predict_cfs/day_addition/01064118KPWM_flow.csv'}

opened by JJNET 5
Poor informer performance

The performance of the Informer model still seems to be poor at least with respect to forecast the Virgin River Flow. There may still be bugs therefore we should investigate it on other datasets and additional unittests. Possibly we should also try to replicate the performance on the ETH datasets the model was trained on (related to #314 ) The model does not seem to learn anything from the temporal data input.

opened by isaacmg 5
Adding GPU support to the Informer

This PR aims to the resolve prior issues #343 as well as fix a new problem related to the label_len in the data-loader. This PR in addition includes documentation updates to the Informer and additional information on how to use relevant data-loaders and SHAP features.

opened by isaacmg 5
DecoderTransformer: Distinguishing Know inputs from Observed inputs

Hello Isaac. First of all thank you for this brilliant project. I was able to run the Decoder Transformer on the EU Wind Energy dataset.

One question though. The model's paper, when defining the problem, says that some exogenous time series are known until the forecast horizon. For example, I would like to add the wind forecast as a feature with a middle dimension equals to "forecast_length" and with the same time idx as the target. Is there a way to model this in your config_file or at a lower level within the Loader objects?

Thank you

Lorenzo Ostano

opened by Vergangenheit 5
TypeError: Object of type Tensor is not JSON serializable when running train_transformer_style with takes_target as 1

Traceback (most recent call last): File "flood_forecast/trainer.py", line 108, in main() File "flood_forecast/trainer.py", line 103, in main train_function(training_config["model_type"], training_config) File "flood_forecast/trainer.py", line 42, in train_function train_transformer_style(model=trained_model, File "/home/harsh/Documents/Coronawhy/flow-forecast/flood_forecast/pytorch_training.py", line 146, in train_transformer_style model.save_model(model_filepath, max_epochs) File "/home/harsh/Documents/Coronawhy/flow-forecast/flood_forecast/time_model.py", line 152, in save_model json.dump(self.params, p) File "/home/harsh/anaconda3/envs/flow-forecast/lib/python3.8/json/init.py", line 179, in dump for chunk in iterable: File "/home/harsh/anaconda3/envs/flow-forecast/lib/python3.8/json/encoder.py", line 431, in _iterencode yield from _iterencode_dict(o, _current_indent_level) File "/home/harsh/anaconda3/envs/flow-forecast/lib/python3.8/json/encoder.py", line 405, in _iterencode_dict yield from chunks File "/home/harsh/anaconda3/envs/flow-forecast/lib/python3.8/json/encoder.py", line 405, in _iterencode_dict yield from chunks File "/home/harsh/anaconda3/envs/flow-forecast/lib/python3.8/json/encoder.py", line 438, in _iterencode o = _default(o) File "/home/harsh/anaconda3/envs/flow-forecast/lib/python3.8/json/encoder.py", line 179, in default raise TypeError(f'Object of type {o.class.name} ' TypeError: Object of type Tensor is not JSON serializable

opened by 97harsh 5
Add meta-data fusion method and documentation
Based on #100 we want to fuse meta-data with temporal data to enable better time series forecasts.

[x] Create a design document of meta-data fusion methods and explain relevant approaches

[x] Review design document with @kritim13 and other teammates.

[x] Implement agreed upon approach

[x] Create a JSON config file and appropriate unit tests.

[x] Test end to end in the Kaggle Notebook.

meta-data
opened by isaacmg 5
Get ASOS data on GCS for years 2014-2019
Get all the data on GCS for those dates.

[x] Create looping function to perform action

[x] Create list of ASOS stations already saved with path on GCS. Upload this file to GCS.

[x] Run and get all data on GCS for all gages
opened by isaacmg 5
DecoderTransformer not implemented as paper at all

did I miss something? The decodertransformer which claims to implement the paper(Enhancing the Locality and Breaking the Memory Bottleneck of Transformer on Time Series Forecasting) in the document is not even close to what the paper proposed. There are no key component like conv1d layers for locality and logsparse. If we didn't implement that paper, then really shouldn't list it in the document.

opened by mvccn 4
Example auto-encoder time series

We could use a detailed end-to-end example of using an AutoEncoder to create representations of temporal data. This should likely be done on Kaggle then added to the flow tutorials repo as a lin.
documentation

opened by isaacmg 0
Pyre type error fixed.

"filename": "flood_forecast/preprocessing/process_usgs.py" "warning_type": "Invalid type [31]" "warning_message": " Expression (pandas.DataFrame, int, int, int) is not a valid type." "warning_line": 82 "fix": remove int,int,int

opened by luca-digrazia 0
Bump shap from 0.40.0 to 0.41.0
Bumps shap from 0.40.0 to 0.41.0.

Release notes

Sourced from shap's releases.

v0.41.0

Lots of bugs fixes and API improvements.

Fixed rare bug with XGBoost model loading by @TheZL @lrjball

Fixed the beeswarm plot so it does not modify the passed explanation object, @ravwojdyla

Automatic wheel building using GH actions by @quantumtec

GC collection for memory in KernelExplainer by @Qingtian-Zou

Fixed max_evals params for PartitionExplainer

JIT optimize the PartitionExplainer

Fix colorbar formatting issues @SleepyPepperHead

New benchmark notebooks

Use display_data for plotting when possible @yuuuxt

Improved GPUTreeShap compilation and params @RAMitchell

Fix TF API change in DeepExplainer @filusn

Add torch tensor support for plots @alexander-pv

Switch to Github actions for testing instead of Travis

New California demo dataset @swalsh1123

Fix waterfall plot bug @RichardScottOZ

Handle missing matplotlib installation @klieret

Add linearize link support for Additive explainer (Nandish Gupta)

Fix exceptions to be more specific @alexisdrakopoulos @collinb9

Add color map option for plotting @tlabarta

Release fixed numpy version requirement @rmehyde

And many other contributions kindly made by @WeichenXu123 @imatiach-msft @zeshengli @nkthiebaut @songololo @GiovannaNicora @joshzwiebel @Ashishbodla @navdeep-G @smathewmanuel @ycouble @anubhavmaity @adityasaini70 @ngupta20 @jckkvs @abs428 @JulesCollenne @Tiagosf00 @javirandor and @Thuener

Commits

510c4b6 Merge pull request #2242 from ravwojdyla/allow-to-control-the-heatmap-size

dd967b6 Merge branch 'master' of https://github.com/slundberg/shap

a791685 fix std to account for averaging

6995c03 Merge branch 'master' into allow-to-control-the-heatmap-size

b6e90c8 Merge pull request #2580 from alexisdrakopoulos/feat/refactor_exceptions

4921c50 Merge pull request #2162 from TheZL/xgbmodel_buffer_lstrip_error_correction

a8dbefd Clean up the intro doc notebook

84ddd09 Merge branch 'feat/refactor_exceptions' of github.com:alexisdrakopoulos/shap ...

348dc7d accidental import

2cfa489 Merge branch 'master' into xgbmodel_buffer_lstrip_error_correction

Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR

@dependabot recreate will recreate this PR, overwriting any edits that have been made to it

@dependabot merge will merge this PR after your CI passes on it

@dependabot squash and merge will squash and merge this PR after your CI passes on it

@dependabot cancel merge will cancel a previously requested merge and block automerging

@dependabot reopen will reopen this PR if it is closed

@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually

@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

dependencies
opened by dependabot[bot] 0

Releases(FF_FIXES_BRANCH_VER)

FF_FIXES_BRANCH_VER(Jun 9, 2022)

Source code(tar.gz)
Source code(zip)
flow_bug(Jun 8, 2022)

Source code(tar.gz)
Source code(zip)
infer_ff(May 24, 2022)

Source code(tar.gz)
Source code(zip)
forecast_0.999(May 10, 2022)

Source code(tar.gz)
Source code(zip)
forecast(Apr 28, 2022)

Source code(tar.gz)
Source code(zip)
forecast_0.99(Apr 27, 2022)

Source code(tar.gz)
Source code(zip)
flow.991(Dec 14, 2021)
This release adds support for the following:

[X] Focal Loss function for imbalanced classification and anomaly detection.

[X] Allows training using multiple criterion (experimental support)

[X] Fixes bug related to training the Informer model

We are aiming for NYE release of Flow Forecast 1.0.
Source code(tar.gz)
Source code(zip)
flow_al_1.0(Sep 20, 2021)
This (pre)-release accomplishes the following items

Fixes bug in the Informer decoding process

Adds support for time series classification

Creates docstrings and additional ReadTheDocs documentation

We are very close to release 1.0 which should be out before October.
Source code(tar.gz)
Source code(zip)
0.98(May 14, 2021)

Source code(tar.gz)
Source code(zip)
0.981_flow(May 14, 2021)
Change Log

Support for using SHAP with the Informer in CPU mode

Fixes several bugs related to handling of Informer info

Increased GPU utilization for specific models

Fix for a PyPi related versioning error.

Fixes a bug related to Wandb and SHAP.

Coming in version 0.99

GPU support for Informer with SHAP

Gaussian Loss bug fixes

Addition of series_id for core data loaders.

Source code(tar.gz)
Source code(zip)
forecast_0.97(May 14, 2021)

Source code(tar.gz)
Source code(zip)
0.956(Apr 30, 2021)
Fix an issue related to dependencies and allow for the using of a single config file. Also gets rid of requiring batch_size in the config. No long config dataset_params.

Informer support for SHAP will be in next release

Source code(tar.gz)
Source code(zip)
forecast_0.93(Apr 12, 2021)

This release adds support for Informer models.
Source code(tar.gz)
Source code(zip)
0.95(Feb 9, 2021)

In this release we add several important bug fixes and new features.
Source code(tar.gz)
Source code(zip)
flow_.93(Jan 12, 2021)

Source code(tar.gz)
Source code(zip)
flow_0.92(Dec 27, 2020)

Source code(tar.gz)
Source code(zip)
forecast_0.92(Nov 13, 2020)

This release includes general updates to dependencies and some inference mode support.
Source code(tar.gz)
Source code(zip)
flow_.09(Sep 23, 2020)

Source code(tar.gz)
Source code(zip)
flow_forecast.88(Sep 6, 2020)

Source code(tar.gz)
Source code(zip)
forecast_0.67(Sep 6, 2020)

Source code(tar.gz)
Source code(zip)
flow.08(Sep 4, 2020)

Source code(tar.gz)
Source code(zip)
0.70(Aug 24, 2020)

Source code(tar.gz)
Source code(zip)
flow_.066(Aug 24, 2020)

Source code(tar.gz)
Source code(zip)
flow_0.41(Aug 17, 2020)

Source code(tar.gz)
Source code(zip)
0.6(Aug 12, 2020)

In this release we add multistep forecasting and the DA-RNN model/w dropout.
Source code(tar.gz)
Source code(zip)
forecast_0.56(Aug 13, 2020)

Source code(tar.gz)
Source code(zip)
0.53(Aug 11, 2020)

Source code(tar.gz)
Source code(zip)
forecast_0.51(Aug 5, 2020)

Source code(tar.gz)
Source code(zip)
flow_0.52(Aug 5, 2020)

Source code(tar.gz)
Source code(zip)
forecast_0.5(Jul 21, 2020)

Fixed issues related to the readme.
Source code(tar.gz)
Source code(zip)

Owner

AIStream

AIStream develops open source deep learning solutions for real world problems

GitHub Repository https://flow-forecast.atlassian.net/wiki/spaces/FF/overview

A mini library for Policy Gradients with Parameter-based Exploration, with reference implementation of the ClipUp optimizer from NNAISENSE.

PGPElib A mini library for Policy Gradients with Parameter-based Exploration [1] and friends. This library serves as a clean re-implementation of the

56 Jan 01, 2023

General neural ODE and DAE modules for power system dynamic modeling.

Py_PSNODE General neural ODE and DAE modules for power system dynamic modeling. The PyTorch-based ODE solver is developed based on torchdiffeq. Sample

14 Dec 31, 2022

A toolkit for developing and comparing reinforcement learning algorithms.

Status: Maintenance (expect bug fixes and minor updates) OpenAI Gym OpenAI Gym is a toolkit for developing and comparing reinforcement learning algori

29.6k Jan 08, 2023

Code and datasets for TPAMI 2021

SkeletonNet This repository constains the codes and ShapeNetV1-Surface-Skeleton,ShapNetV1-SkeletalVolume and 2d image datasets ShapeNetRendering. Plea

34 Aug 15, 2022

Implementation of ConvMixer for "Patches Are All You Need? 🤷"

Patches Are All You Need? 🤷 This repository contains an implementation of ConvMixer for the ICLR 2022 submission "Patches Are All You Need?" by Asher

934 Jan 08, 2023

Apache Spark - A unified analytics engine for large-scale data processing

Apache Spark Spark is a unified analytics engine for large-scale data processing. It provides high-level APIs in Scala, Java, Python, and R, and an op

34.7k Jan 04, 2023

Binary classification for arrythmia detection with ECG datasets.

HEART DISEASE AI DATATHON 2021 [Eng] / [Kor] #English This is an AI diagnosis modeling contest that uses the heart disease echocardiography and electr

3 Jul 14, 2022

Github Traffic Insights as Prometheus metrics.

github-traffic Github Traffic collects your repository's traffic data and exposes it as Prometheus metrics. Grafana dashboard that displays the metric

34 Oct 27, 2022

Atomistic Line Graph Neural Network

Table of Contents Introduction Installation Examples Pre-trained models Quick start using colab JARVIS-ALIGNN webapp Peformances on a few datasets Use

91 Dec 30, 2022

SAS: Self-Augmentation Strategy for Language Model Pre-training

SAS: Self-Augmentation Strategy for Language Model Pre-training This repository

5 Nov 02, 2022

Styleformer - Official Pytorch Implementation

Styleformer -- Official PyTorch implementation Styleformer: Transformer based Generative Adversarial Networks with Style Vector(https://arxiv.org/abs/

159 Dec 12, 2022

Official repository for "Restormer: Efficient Transformer for High-Resolution Image Restoration". SOTA for motion deblurring, image deraining, denoising (Gaussian/real data), and defocus deblurring.

Restormer: Efficient Transformer for High-Resolution Image Restoration Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan,

906 Dec 30, 2022

Deep learning PyTorch library for time series forecasting, classification, and anomaly detection

Related tags

Overview

Deep learning for time series forecasting

Getting Started

Contributing

Historical River Flow Data

Task 1 Stream Flow Forecasting

Task 2 Flood severity forecasting

Comments

v0.41.0

Releases(FF_FIXES_BRANCH_VER)

FF_FIXES_BRANCH_VER(Jun 9, 2022)

flow_bug(Jun 8, 2022)

infer_ff(May 24, 2022)

forecast_0.999(May 10, 2022)

forecast(Apr 28, 2022)

forecast_0.99(Apr 27, 2022)

flow.991(Dec 14, 2021)

flow_al_1.0(Sep 20, 2021)

0.98(May 14, 2021)

0.981_flow(May 14, 2021)

forecast_0.97(May 14, 2021)

0.956(Apr 30, 2021)

forecast_0.93(Apr 12, 2021)

0.95(Feb 9, 2021)

flow_.93(Jan 12, 2021)

flow_0.92(Dec 27, 2020)

forecast_0.92(Nov 13, 2020)

flow_.09(Sep 23, 2020)

flow_forecast.88(Sep 6, 2020)

forecast_0.67(Sep 6, 2020)

flow.08(Sep 4, 2020)

0.70(Aug 24, 2020)

flow_.066(Aug 24, 2020)

flow_0.41(Aug 17, 2020)

0.6(Aug 12, 2020)

forecast_0.56(Aug 13, 2020)

0.53(Aug 11, 2020)

forecast_0.51(Aug 5, 2020)

flow_0.52(Aug 5, 2020)

forecast_0.5(Jul 21, 2020)

Owner

AIStream

A mini library for Policy Gradients with Parameter-based Exploration, with reference implementation of the ClipUp optimizer from NNAISENSE.

General neural ODE and DAE modules for power system dynamic modeling.

A toolkit for developing and comparing reinforcement learning algorithms.

Code and datasets for TPAMI 2021

Implementation of ConvMixer for "Patches Are All You Need? 🤷"

Apache Spark - A unified analytics engine for large-scale data processing

Binary classification for arrythmia detection with ECG datasets.

Github Traffic Insights as Prometheus metrics.

Atomistic Line Graph Neural Network

SAS: Self-Augmentation Strategy for Language Model Pre-training

Styleformer - Official Pytorch Implementation

Official repository for "Restormer: Efficient Transformer for High-Resolution Image Restoration". SOTA for motion deblurring, image deraining, denoising (Gaussian/real data), and defocus deblurring.

Official implementation of the paper Label-Efficient Semantic Segmentation with Diffusion Models

Cascaded Deep Video Deblurring Using Temporal Sharpness Prior and Non-local Spatial-Temporal Similarity

Denoising Normalizing Flow

Crosslingual Segmental Language Model

The codes and related files to reproduce the results for Image Similarity Challenge Track 1.

SMPLpix: Neural Avatars from 3D Human Models

Generate high quality pictures. GAN. Generative Adversarial Networks

Code for the paper "Adapting Monolingual Models: Data can be Scarce when Language Similarity is High"