PyTorch implementation of Munchausen Reinforcement Learning based on DQN and SAC. Handles discrete and continuous action spaces

Last update: Mar 10, 2022

Overview

Exploring Munchausen Reinforcement Learning

This is the project repository of my team in the "Advanced Deep Learning for Robotics" course at TUM. Our project's topic is "Exploring Munchausen Reinforcement Learning" based on this paper.

For a detailed discussion, see the report and the final presentation.

Setup

Create a virtual environment.
Run pip3 install -r requirements.txt

Code Structure

This repository is structured as follows:

The directories M-DQN and M-SAC contain the implementations of the RL agents DQN and SAC extended with the Munchausen term, respectively.
The directories rl-baselines3-zoo contains a copy of this repository, where we included the implementations of M-DQN so that we can easily train and test the M-DQN agent on benchmark environments and also compare it to other classical agents. To do so, just follow the steps described in the original repository and insert M-DQN as the agent argument.
The directory particles-envcontains a modified version of this repository. The modified version contains code for a particles environment, where an agent wants to reach a goal, while avoiding obstacles. Besides, M-SAC agent is implemented and included in the code, so that it can be trained and compared to the classical SAC agent.
The directory action-gap contains implementation of callbacks for experiment manager of rl-baselines3-zoo which logs action-gap for tensorboard.

PyTorch implementation of Munchausen Reinforcement Learning based on DQN and SAC. Handles discrete and continuous action spaces

Related tags

Overview

Exploring Munchausen Reinforcement Learning

Setup

Code Structure

Owner

Mohamed Amine Ketata

Using CNN to mimic the driver based on training data from Torcs

tf2-keras implement yolov5

GRaNDPapA: Generator of Rad Names from Decent Paper Acronyms

SEC'21: Sparse Bitmap Compression for Memory-Efficient Training onthe Edge

Training and Evaluation Code for Neural Volumes

Revitalizing CNN Attention via Transformers in Self-Supervised Visual Representation Learning

Pytorch implementation for ACMMM2021 paper "I2V-GAN: Unpaired Infrared-to-Visible Video Translation".

Data and Code for paper Outlining and Filling: Hierarchical Query Graph Generation for Answering Complex Questions over Knowledge Graph is available for research purposes.

This is an official pytorch implementation of Fast Fourier Convolution.

Optimising chemical reactions using machine learning

using yolox+deepsort for object-tracker

Official implementation of Deep Convolutional Dictionary Learning for Image Denoising.

Dados coletados e programas desenvolvidos no processo de iniciação científica

Reducing Information Bottleneck for Weakly Supervised Semantic Segmentation (NeurIPS 2021)

Differentiable Surface Triangulation

🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.

reimpliment of DFANet: Deep Feature Aggregation for Real-Time Semantic Segmentation

Official implementation of NPMs: Neural Parametric Models for 3D Deformable Shapes - ICCV 2021

Neural Magic Eye: Learning to See and Understand the Scene Behind an Autostereogram, arXiv:2012.15692.

[CVPR 2021] MiVOS - Scribble to Mask module