List of Implementations:

Currently, the reimplementation of the DeepAR paper(DeepAR: Probabilistic Forecasting with Autoregressive Recurrent Networks https://arxiv.org/abs/1704.04110) is available in PyTorch. More papers will be coming soon.

Authors:

Yunkai Zhang([email protected]) - University of California, Santa Barbara
Qiao Jiang - Brown University
Xueying Ma - Columbia University
Acknowledgement: Professor Xifeng Yan's group at UC Santa Barbara. Part of the work was done at WeWork.

To run:

Install all dependencies listed in requirements.txt. Note that the model has only been tested in the versions shown in the text file.
Download the dataset and preprocess the data:
```
python preprocess_elect.py
```
Start training:
```
python train.py
```
- If you want to perform ancestral sampling,
```
python train.py --sampling
```
- If you do not want to do normalization during evaluation,
```
python train.py --relative-metrics
```
Evaluate a set of saved model weights:
```
python evaluate.py
```
Perform hyperparameter search:
```
 python search_params.py
```

Results

The model is evaluated on the electricity dataset, which contains the electricity consumption of 370 households from 2011 to 2014. Under hourly frequency, we use the first week of September, 2014 as the test set and all time steps prior to that as the train set. Following the experiment design in DeepAR, the window size is chosen to be 192, where the last 24 is the forecasting horizon. History (number of time steps since the beginning of each household), month of the year, day of the week, and hour of the day are used as time covariates. Notice that some households started at different times, so we only use windows that contain non-missing values.

Under Gaussian likelihood, we use the Adam optimizer with early stopping to train the model for 20 epoches. The same set of hyperparameters is used as outlined in the paper. Weights with the best ND value is selected, where ND = 0.06349, RMSE = 0.452, rou90 = 0.034 and rou50 = 0.063.

Sample results on electricity. The top 10 plots are sampled from the test set with the highest 10% ND values, whereas the bottom 10 plots are sampled from the rest of the test set.

Implementation of deep learning models for time series in PyTorch.

Related tags

Overview

List of Implementations:

Authors:

To run:

Results

Owner

Yunkai Zhang

Regularization and Feature Selection in Least Squares Temporal Difference Learning

A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning

TorchDrug is a PyTorch-based machine learning toolbox designed for drug discovery

Implementation of deep learning models for time series in PyTorch.

Spark development environment for k8s

CyLP is a Python interface to COIN-OR’s Linear and mixed-integer program solvers (CLP, CBC, and CGL)

Interactive Web App with Streamlit and Scikit-learn that applies different Classification algorithms to popular datasets

MachineLearningStocks is designed to be an intuitive and highly extensible template project applying machine learning to making stock predictions.

TensorFlow Decision Forests (TF-DF) is a collection of state-of-the-art algorithms for the training, serving and interpretation of Decision Forest models.

Bayesian optimization in JAX

A Python implementation of GRAIL, a generic framework to learn compact time series representations.

a distributed deep learning platform

Scikit-Learn useful pre-defined Pipelines Hub

Machine learning that just works, for effortless production applications

scikit-fem is a lightweight Python 3.7+ library for performing finite element assembly.

Stock Price Prediction Bank Jago Using Facebook Prophet Machine Learning & Python

Data Version Control or DVC is an open-source tool for data science and machine learning projects

Python library which makes it possible to dynamically mask/anonymize data using JSON string or python dict rules in a PySpark environment.

This repo includes some graph-based CTR prediction models and other representative baselines.

A data preprocessing package for time series data. Design for machine learning and deep learning.