A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

Last update: Dec 28, 2022

Overview

Reinforcement-Learning-Notebooks

A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

I wrote these notebooks in March 2017 while I took the COMP 767: Reinforcement Learning [5] class by Prof. Doina Precup at McGill, Montréal. I highly recommend you to go through the class notes and references of all the papers the intructors have posted on the website.

These notebooks should be used while you read the book and go beyond the same with the referenced papers. I would suggest watching David Silver's videos and reading the book simultaneously. And when you are done with a few chapters, start implementing them. The algorithms follow a pattern and mostly are variants of each other. I have tried my best to explain each notebook's results and possible future directions.

Disclaimer: The code is a little messy. I'd written this when I was not a Pythonista. If you would like to clean them up and want to make it into a nice interface, feel free to contact me. I will be very pleased to collaborate. If you use them then please cite the source and also mention the credits as listed below. Also, email me with ways to improve, let me know if you find any bugs.

Feel free to reach me at [email protected] or see my website here

Special Credits:

[1] Denny Britz

[2] Monica Patel

[3] Sutton and Barto

[4] David Silver

[5] Doina Precup's course

A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

Related tags

Overview

Reinforcement-Learning-Notebooks

A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

Owner

Pulkit Khandelwal

Anatomy of Matplotlib -- tutorial developed for the SciPy conference

FlowTorch is a PyTorch library for learning and sampling from complex probability distributions using a class of methods called Normalizing Flows

scAR (single-cell Ambient Remover) is a package for data denoising in single-cell omics.

Async API for controlling Hue Lights

Art Project "Schrödinger's Game of Life"

This is a virtual picture dragging application. Users may virtually slide photos across the screen. The distance between the index and middle fingers determines the movement. Smaller distances indicate click and motion, whereas bigger distances indicate only hand movement.

This a classic fintech problem that introduces real life difficulties such as data imbalance. Check out the notebook to find out more!

Exploring Machine Learning Models for detecting anomalous behavior in credit-card transactions. It's crucial that credit-card companies are able to recognize fraudulent activity so that customers are not charged for items they didn't purchase.

PyTorch implementation of Deformable Convolution

Discord bot for notifying on github events

OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)

Prefix-Tuning: Optimizing Continuous Prompts for Generation

PyTorch Lightning + Hydra. A feature-rich template for rapid, scalable and reproducible ML experimentation with best practices. ⚡🔥⚡

Image Recognition using Pytorch

Implementation of a Transformer, but completely in Triton

Scrutinizing XAI with linear ground-truth data

Scikit-event-correlation - Event Correlation and Forecasting over High Dimensional Streaming Sensor Data algorithms

OpenMMLab Text Detection, Recognition and Understanding Toolbox

Attendance Monitoring with Face Recognition using Python

Locally Most Powerful Bayesian Test for Out-of-Distribution Detection using Deep Generative Models