A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

Last update: Dec 28, 2022

Overview

Reinforcement-Learning-Notebooks

A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

I wrote these notebooks in March 2017 while I took the COMP 767: Reinforcement Learning [5] class by Prof. Doina Precup at McGill, Montréal. I highly recommend you to go through the class notes and references of all the papers the intructors have posted on the website.

These notebooks should be used while you read the book and go beyond the same with the referenced papers. I would suggest watching David Silver's videos and reading the book simultaneously. And when you are done with a few chapters, start implementing them. The algorithms follow a pattern and mostly are variants of each other. I have tried my best to explain each notebook's results and possible future directions.

Disclaimer: The code is a little messy. I'd written this when I was not a Pythonista. If you would like to clean them up and want to make it into a nice interface, feel free to contact me. I will be very pleased to collaborate. If you use them then please cite the source and also mention the credits as listed below. Also, email me with ways to improve, let me know if you find any bugs.

Feel free to reach me at [email protected] or see my website here

Special Credits:

[1] Denny Britz

[2] Monica Patel

[3] Sutton and Barto

[4] David Silver

[5] Doina Precup's course

A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

Related tags

Overview

Reinforcement-Learning-Notebooks

A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

Owner

Pulkit Khandelwal

Another pytorch implementation of FCN (Fully Convolutional Networks)

Out-of-Distribution Generalization of Chest X-ray Using Risk Extrapolation

PyTorch implementation of Constrained Policy Optimization

PyTorch implementation for STIN

Codebase for BMVC 2021 paper "Text Based Person Search with Limited Data"

Object DGCNN and DETR3D, Our implementations are built on top of MMdetection3D.

Scalable and Elastic Deep Reinforcement Learning Using PyTorch. Please star. 🔥

Compares various time-series feature sets on computational performance, within-set structure, and between-set relationships.

This is the unofficial code of Deep Dual-resolution Networks for Real-time and Accurate Semantic Segmentation of Road Scenes. which achieve state-of-the-art trade-off between accuracy and speed on cityscapes and camvid, without using inference acceleration and extra data

performing moving objects segmentation using image processing techniques with opencv and numpy

Official repository for the ICLR 2021 paper Evaluating the Disentanglement of Deep Generative Models with Manifold Topology

TrackTech: Real-time tracking of subjects and objects on multiple cameras

ImVoxelNet: Image to Voxels Projection for Monocular and Multi-View General-Purpose 3D Object Detection

Interacting Two-Hand 3D Pose and Shape Reconstruction from Single Color Image (ICCV 2021)

Monk is a low code Deep Learning tool and a unified wrapper for Computer Vision.

Instance-Dependent Partial Label Learning

Explainability for Vision Transformers (in PyTorch)

Perform zero-order Hankel Transform for an 1D array (float or real valued).

Reinforcement Learning for finance

CCP dataset from Clothing Co-Parsing by Joint Image Segmentation and Labeling