Sequence Modeling Benchmarks and Temporal Convolutional Networks (TCN)

This repository contains the experiments done in the work An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling by Shaojie Bai, J. Zico Kolter and Vladlen Koltun.

We specifically target a comprehensive set of tasks that have been repeatedly used to compare the effectiveness of different recurrent networks, and evaluate a simple, generic but powerful (purely) convolutional network on the recurrent nets' home turf.

Experiments are done in PyTorch. If you find this repository helpful, please cite our work:

@article{BaiTCN2018,
	author    = {Shaojie Bai and J. Zico Kolter and Vladlen Koltun},
	title     = {An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling},
	journal   = {arXiv:1803.01271},
	year      = {2018},
}

Domains and Datasets

Update: The code should be directly runnable with PyTorch v1.0.0 or above (PyTorch v>1.3.0 strongly recommended). The older versions of PyTorch are no longer supported.

This repository contains the benchmarks to the following tasks, with details explained in each sub-directory:

The Adding Problem with various T (we evaluated on T=200, 400, 600)
Copying Memory Task with various T (we evaluated on T=500, 1000, 2000)
Sequential MNIST digit classification
Permuted Sequential MNIST (based on Seq. MNIST, but more challenging)
JSB Chorales polyphonic music
Nottingham polyphonic music
PennTreebank [SMALL] word-level language modeling (LM)
Wikitext-103 [LARGE] word-level LM
LAMBADA [LARGE] word-level LM and textual understanding
PennTreebank [MEDIUM] char-level LM
text8 [LARGE] char-level LM

While some of the large datasets are not included in this repo, we use the observations package to download them, which can be easily installed using pip.

Usage

Each task is contained in its own directory, with the following structure:

[TASK_NAME] /
    data/
    [TASK_NAME]_test.py
    models.py
    utils.py

To run TCN model on the task, one only need to run [TASK_NAME]_test.py (e.g. add_test.py). To tune the hyperparameters, one can specify via argument options, which can been seen via the -h flag.

Sequence modeling benchmarks and temporal convolutional networks

Related tags

Overview

Sequence Modeling Benchmarks and Temporal Convolutional Networks (TCN)

Domains and Datasets

Usage

Owner

CMU Locus Lab

EMNLP'2021: Can Language Models be Biomedical Knowledge Bases?

Code for Findings of ACL 2022 Paper "Sentiment Word Aware Multimodal Refinement for Multimodal Sentiment Analysis with ASR Errors"

Official code of our work, Unified Pre-training for Program Understanding and Generation [NAACL 2021].

HiFi DeepVariant + WhatsHap workflowHiFi DeepVariant + WhatsHap workflow

The ability of computer software to identify words and phrases in spoken language and convert them to human-readable text

Fast, general, and tested differentiable structured prediction in PyTorch

Sentence boundary disambiguation tool for Japanese texts (日本語文境界判定器)

Snowball compiler and stemming algorithms

A natural language modeling framework based on PyTorch

Knowledge Management for Humans using Machine Learning & Tags

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab

👄 The most accurate natural language detection library for Python, suitable for long and short text alike

GCRC: A Gaokao Chinese Reading Comprehension dataset for interpretable Evaluation

AEC_DeepModel - Deep learning based acoustic echo cancellation baseline code

chaii - hindi & tamil question answering

DeepAmandine is an artificial intelligence that allows you to talk to it for hours, you won't know the difference.

Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).

CYGNUS, the Cynical AI, combines snarky responses with uncanny aggression.

Grover is a model for Neural Fake News -- both generation and detectio

Analyse japanese ebooks using MeCab to determine the difficulty level for japanese learners