Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

Last update: Jan 07, 2023

Related tags

Overview

Decision Transformer

Lili Chen*, Kevin Lu*, Aravind Rajeswaran, Kimin Lee, Aditya Grover, Michael Laskin, Pieter Abbeel, Aravind Srinivas†, and Igor Mordatch†

*equal contribution, †equal advising

A link to our paper can be found on arXiv.

Overview

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling. Contains scripts to reproduce experiments.

Instructions

We provide code in two sub-directories: atari containing code for Atari experiments and gym containing code for OpenAI Gym experiments. See corresponding READMEs in each folder for instructions; scripts should be run from the respective directories. It may be necessary to add the respective directories to your PYTHONPATH.

Citation

Please cite our paper as:

@article{chen2021decisiontransformer,
  title={Decision Transformer: Reinforcement Learning via Sequence Modeling},
  author={Lili Chen and Kevin Lu and Aravind Rajeswaran and Kimin Lee and Aditya Grover and Michael Laskin and Pieter Abbeel and Aravind Srinivas and Igor Mordatch},
  journal={arXiv preprint arXiv:2106.01345},
  year={2021}
}

Note: this is not an official Google or Facebook product.

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

Related tags

Overview

Decision Transformer

Overview

Instructions

Citation

Owner

Kevin Lu

DeepCAD: A Deep Generative Network for Computer-Aided Design Models

Deep Learning and Logical Reasoning from Data and Knowledge

A Python script that creates subtitles of a given length from text paragraphs that can be easily imported into any Video Editing software such as FinalCut Pro for further adjustments.

Understanding the Generalization Benefit of Model Invariance from a Data Perspective

Danfeng Hong, Lianru Gao, Jing Yao, Bing Zhang, Antonio Plaza, Jocelyn Chanussot. Graph Convolutional Networks for Hyperspectral Image Classification, IEEE TGRS, 2021.

Implementation of the CVPR 2021 paper "Online Multiple Object Tracking with Cross-Task Synergy"

Differentiable Annealed Importance Sampling (DAIS)

AWS documentation corpus for zero-shot open-book question answering.

[ACM MM 2021] Multiview Detection with Shadow Transformer (and View-Coherent Data Augmentation)

Code of our paper "Contrastive Object-level Pre-training with Spatial Noise Curriculum Learning"

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

Simultaneous Demand Prediction and Planning

LowRankModels.jl is a julia package for modeling and fitting generalized low rank models.

Just Go with the Flow: Self-Supervised Scene Flow Estimation

CVPR2021: Temporal Context Aggregation Network for Temporal Action Proposal Refinement

[NeurIPS 2020] Semi-Supervision (Unlabeled Data) & Self-Supervision Improve Class-Imbalanced / Long-Tailed Learning

In the case of your data having only 1 channel while want to use timm models

Pytorch implementation of SimSiam Architecture

Code and models for ICCV2021 paper "Robust Object Detection via Instance-Level Temporal Cycle Confusion".

Unicorn can be used for performance analyses of highly configurable systems with causal reasoning