PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch

Last update: Dec 08, 2022

Overview

Advantage async actor-critic Algorithms (A3C) in PyTorch

@inproceedings{mnih2016asynchronous,
  title={Asynchronous methods for deep reinforcement learning},
  author={Mnih, Volodymyr and Badia, Adria Puigdomenech and Mirza, Mehdi and Graves, Alex and Lillicrap, Timothy P and Harley, Tim and Silver, David and Kavukcuoglu, Koray},
  booktitle={International Conference on Machine Learning},
  year={2016}}

This repository contains an implementation of Adavantage async Actor-Critic (A3C) in PyTorch based on the original paper by the authors and the PyTorch implementation by Ilya Kostrikov.

A3C is the state-of-art Deep Reinforcement Learning method.

Dependencies

Python 2.7
PyTorch
gym (OpenAI)
universe (OpenAI)
opencv (for env state processing)
visdom (for visualization)

Training

./train_lstm.sh

Test wigh trained weight after 169000 updates for PongDeterminisitc-v3.

./test_lstm.sh 169000

A test result video is available.

PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch

Related tags

Overview

Advantage async actor-critic Algorithms (A3C) in PyTorch

Dependencies

Training

Test wigh trained weight after 169000 updates for PongDeterminisitc-v3.

Check the loss curves of all threads in http://localhost:8097

References

Owner

LEI TAI

[WACV21] Code for our paper: Samuel, Atzmon and Chechik, "From Generalized zero-shot learning to long-tail with class descriptors"

Model parallel transformers in Jax and Haiku

Scripts and a shader to get you started on setting up an exported Koikatsu character in Blender.

Causal estimators for use with WhyNot

The software associated with a paper accepted at EMNLP 2021 titled "Open Knowledge Graphs Canonicalization using Variational Autoencoders".

The source code and data of the paper "Instance-wise Graph-based Framework for Multivariate Time Series Forecasting".

🗣️ Microsoft Edge TTS for Home Assistant, no need for app_key

Repository relating to the CVPR21 paper TimeLens: Event-based Video Frame Interpolation

ExCon: Explanation-driven Supervised Contrastive Learning

Additional environments compatible with OpenAI gym

A PyTorch implementation of "Pathfinder Discovery Networks for Neural Message Passing"

Self-Regulated Learning for Egocentric Video Activity Anticipation

Boostcamp CV Serving For Python

Improved Fitness Optimization Landscapes for Sequence Design

A Nim frontend for pytorch, aiming to be mostly auto-generated and internally using ATen.

Jetson Nano-based smart camera system that measures crowd face mask usage in real-time.

Convnet transfer - Code for paper How transferable are features in deep neural networks?

PyTorch implementation of the Crafting Better Contrastive Views for Siamese Representation Learning

implementation of paper - You Only Learn One Representation: Unified Network for Multiple Tasks

The 3rd place solution for competition