Do you want a RL agent nicely moving on Atari?

Rainbow is all you need!

This is a step-by-step tutorial from DQN to Rainbow. Every chapter contains both of theoretical backgrounds and object-oriented implementation. Just pick any topic in which you are interested, and learn! You can execute them right away with Colab even on your smartphone.

Please feel free to open an issue or a pull-request if you have any idea to make it better. :)

If you want a tutorial for policy gradient methods, please see PG is All You Need.

DQN [NBViewer] [Colab]
DoubleDQN [NBViewer] [Colab]
PrioritizedExperienceReplay [NBViewer] [Colab]
DuelingNet [NBViewer] [Colab]
NoisyNet [NBViewer] [Colab]
CategoricalDQN [NBViewer] [Colab]
N-stepLearning [NBViewer] [Colab]
Rainbow [NBViewer] [Colab]

Prerequisites

This repository is tested on Anaconda virtual environment with python 3.7+

$ conda create -n rainbow-is-all-you-need python=3.7
$ conda activate rainbow-is-all-you-need

Installation

First, clone the repository.

git clone https://github.com/Curt-Park/rainbow-is-all-you-need.git
cd rainbow-is-all-you-need

Secondly, install packages required to execute the code. Just type:

make setup

Contributors

Thanks goes to these wonderful people (emoji key):

_{Jinwoo Park (Curt)}

_{Kyunghwan Kim}

_{Wei Chen}

_{WANG Lei}

_leeyaf

_ahmadF

This project follows the all-contributors specification. Contributions of any kind welcome!

Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow

Related tags

Overview

Rainbow is all you need!

Contents

Prerequisites

Installation

Related Papers

Contributors

Owner

Jinwoo Park (Curt)

Efficiently Disentangle Causal Representations

Code for the paper "A Study of Face Obfuscation in ImageNet"

Machine learning algorithms for many-body quantum systems

LSTM model trained on a small dataset of 3000 names written in PyTorch

NaturalProofs: Mathematical Theorem Proving in Natural Language

Run Effective Large Batch Contrastive Learning on Limited Memory GPU

Deep-Learning-Book-Chapter-Summaries - Attempting to make the Deep Learning Book easier to understand.

All-in-one Docker container that allows a user to explore Nautobot in a lab environment.

Random Forests for Regression with Missing Entries

[CVPR 2021] Unsupervised 3D Shape Completion through GAN Inversion

3D-printable hand-strapped keyboard

The official PyTorch code for NeurIPS 2021 ML4AD Paper, "Does Thermal data make the detection systems more reliable?"

Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!

Sample Code for "Pessimism Meets Invariance: Provably Efficient Offline Mean-Field Multi-Agent RL"

Transferable Unrestricted Attacks, which won 1st place in CVPR’21 Security AI Challenger: Unrestricted Adversarial Attacks on ImageNet.

Cave Generation using metaballs in Blender. Originally created by sdfgeoff, Edited by Myself (Archie Jaskowicz).

Face recognize and crop them

Official code repository for the publication "Latent Equilibrium: A unified learning theory for arbitrarily fast computation with arbitrarily slow neurons"

Code for the Lovász-Softmax loss (CVPR 2018)

A unified framework for machine learning with time series