PyTorch implementation for paper StARformer: Transformer with State-Action-Reward Representations.

Last update: Dec 09, 2022

Related tags

Overview

StARformer

This repository contains the PyTorch implementation for our paper titled StARformer: Transformer with State-Action-Reward Representations. We learn local State-Action-Reward representations (StAR-representations) to improve (long) sequence modeling for reinforcement learning (and imitation learning).

Results

Installation

Dependencies can be installed by Conda:

conda env create -f my_env.yml

And install Atari ROMs.

Datasets

Please follow this instruction for datasets.

Example usage

See run.sh or below:

python run_star_atari.py --seed 123 --data_dir_prefix [data_directory] --epochs 10 --num_steps 500000 --num_buffers 50 --batch_size 64 --seq_len 30 --model_type 'star' --game 'Breakout'

[data_directory] is where you place the Atari dataset.

Variants (`model_type`):

'star' (imitation)
'star_rwd' (offline RL)
'star_fusion' (see Figure 4a in our paper)
'star_stack' (see Figure 4b in our paper)

Acknowledgement

This code is based on Decision-Transformer.

PyTorch implementation for paper StARformer: Transformer with State-Action-Reward Representations.

Related tags

Overview

StARformer

Results

Installation

Datasets

Example usage

Variants (`model_type`):

Acknowledgement

Owner

Jinghuan Shang

official implemntation for "Contrastive Learning with Stronger Augmentations"

Image processing in Python

ToFFi - Toolbox for Frequency-based Fingerprinting of Brain Signals

[IJCAI'21] Deep Automatic Natural Image Matting

An implementation of an abstract algebra for music tones (pitches).

EDCNN: Edge enhancement-based Densely Connected Network with Compound Loss for Low-Dose CT Denoising

We simulate traveling back in time with a modern camera to rephotograph famous historical subjects.

Pytorch implement of 'Unmixing based PAN guided fusion network for hyperspectral imagery'

(CVPR2021) ClassSR: A General Framework to Accelerate Super-Resolution Networks by Data Characteristic

Code for SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations

Code, Data and Demo for Paper: Controllable Generation from Pre-trained Language Models via Inverse Prompting

SANet: A Slice-Aware Network for Pulmonary Nodule Detection

On Effective Scheduling of Model-based Reinforcement Learning

Efficiently computes derivatives of numpy code.

HyperDict - Self linked dictionary in Python

PyTorch implementation of our ICCV 2019 paper: Liquid Warping GAN: A Unified Framework for Human Motion Imitation, Appearance Transfer and Novel View Synthesis

Vector.ai assignment

Deep Unsupervised 3D SfM Face Reconstruction Based on Massive Landmark Bundle Adjustment.

Official implementation of Densely connected normalizing flows

Pytorch implementation of Decoupled Spatial-Temporal Transformer for Video Inpainting

PyTorch implementation for paper StARformer: Transformer with State-Action-Reward Representations.

Related tags

Overview

StARformer

Results

Installation

Datasets

Example usage

Variants (model_type):

Acknowledgement

Owner

Jinghuan Shang

official implemntation for "Contrastive Learning with Stronger Augmentations"

Image processing in Python

ToFFi - Toolbox for Frequency-based Fingerprinting of Brain Signals

[IJCAI'21] Deep Automatic Natural Image Matting

An implementation of an abstract algebra for music tones (pitches).

EDCNN: Edge enhancement-based Densely Connected Network with Compound Loss for Low-Dose CT Denoising

We simulate traveling back in time with a modern camera to rephotograph famous historical subjects.

Pytorch implement of 'Unmixing based PAN guided fusion network for hyperspectral imagery'

(CVPR2021) ClassSR: A General Framework to Accelerate Super-Resolution Networks by Data Characteristic

Code for SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations

Code, Data and Demo for Paper: Controllable Generation from Pre-trained Language Models via Inverse Prompting

SANet: A Slice-Aware Network for Pulmonary Nodule Detection

On Effective Scheduling of Model-based Reinforcement Learning

Efficiently computes derivatives of numpy code.

HyperDict - Self linked dictionary in Python

PyTorch implementation of our ICCV 2019 paper: Liquid Warping GAN: A Unified Framework for Human Motion Imitation, Appearance Transfer and Novel View Synthesis

Vector.ai assignment

Deep Unsupervised 3D SfM Face Reconstruction Based on Massive Landmark Bundle Adjustment.

Official implementation of Densely connected normalizing flows

Pytorch implementation of Decoupled Spatial-Temporal Transformer for Video Inpainting

Variants (`model_type`):