Reinforcement Learning Tricks, Index

This repository contains the code for the paper "Distilling Reinforcement Learning Tricks for Video Games".

Short story shorter: RL algorithms are neat and all, but to get it to work in video games (RL competitions and whatnot), there are some nifty little tricks involved that need bit of expertise in the domain. This includes reward shaping, curriculum learning, splitting task into subtasks by hand and guiding agent's actions. We took some of these tricks and tried them on three environments with DQN. With right setup you get more out of DQN.

Code authors: Anssi Kanervisto, Christian Scheller and Yanick Schraner.

The experiments in the three environments are split into three git branches:

vizdoom for ViZDoom Deathmatch experiments
minerl for MineRL ObtainDiamond experiments
gfootball for Football environment experiments

To run the experiments, checkout the repository you want to run experiments for with git checkout [branch name], and follow the instructions in the README file there.

After running all the experiments, collect the results as described the respective branches. You should have three directories

vizdoom-runs
minerl-runs
football-runs

After this, running python plot_paper.py should create a figures/learning_curves.pdf file which summarizes the results.

Evaluating different engineering tricks that make RL work

Related tags

Overview

Reinforcement Learning Tricks, Index

Owner

Anssi

[ICCV 2021] Our work presents a novel neural rendering approach that can efficiently reconstruct geometric and neural radiance fields for view synthesis.

Code base for "On-the-Fly Test-time Adaptation for Medical Image Segmentation"

CSD: Consistency-based Semi-supervised learning for object Detection

Training PSPNet in Tensorflow. Reproduce the performance from the paper.

Code for the Lovász-Softmax loss (CVPR 2018)

Official Datasets and Implementation from our Paper "Video Class Agnostic Segmentation in Autonomous Driving".

Recursive Bayesian Networks

Empowering journalists and whistleblowers

As a part of the HAKE project, includes the reproduced SOTA models and the corresponding HAKE-enhanced versions (CVPR2020).

Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation

Orange Chicken: Data-driven Model Generalizability in Crosslinguistic Low-resource Morphological Segmentation

Deeper insights into graph convolutional networks for semi-supervised learning

Implementation of ICCV21 paper: PnP-DETR: Towards Efficient Visual Analysis with Transformers

S2-BNN: Bridging the Gap Between Self-Supervised Real and 1-bit Neural Networks via Guided Distribution Calibration (CVPR 2021)

Codebase for the self-supervised goal reaching benchmark introduced in the LEXA paper

Evolving neural network parameters in JAX.

This repository contains answers of the Shopify Summer 2022 Data Science Intern Challenge.

(CVPR 2022) Pytorch implementation of "Self-supervised transformers for unsupervised object discovery using normalized cut"

OBG-FCN - implementation of 'Object Boundary Guided Semantic Segmentation'

PyTorch code for our paper "Attention in Attention Network for Image Super-Resolution"