Reinforcement Learning with Q-Learning Algorithm on gym's frozen lake environment implemented in python

Overview

Reinforcement Learning with Q Learning Algorithm

Q learning algorithm is trained on the gym's frozen lake environment.

Libraries Used

  • gym
  • Numpy
  • tqdm
  • Pytorch Deep Learning Framework

  • Install Requirement Files

    clone the repository or download the 'requirement.txt' files, then open terminal in the working directory and type
    'pip install -r requirements.txt'
    to install all the requirements for this project.

    Demo Video

    Q-learning.mp4
    Pip-package for trajectory benchmarking from "Be your own Benchmark: No-Reference Trajectory Metric on Registered Point Clouds", ECMR'21

    Map Metrics for Trajectory Quality Map metrics toolkit provides a set of metrics to quantitatively evaluate trajectory quality via estimating consiste

    Mobile Robotics Lab. at Skoltech 31 Oct 28, 2022
    The code for our paper "AutoSF: Searching Scoring Functions for Knowledge Graph Embedding"

    AutoSF The code for our paper "AutoSF: Searching Scoring Functions for Knowledge Graph Embedding" and this paper has been accepted by ICDE2020. News:

    AutoML Research 64 Dec 17, 2022
    piSTAR Lab is a modular platform built to make AI experimentation accessible and fun. (pistar.ai)

    piSTAR Lab WARNING: This is an early release. Overview piSTAR Lab is a modular deep reinforcement learning platform built to make AI experimentation a

    piSTAR Lab 0 Aug 01, 2022
    Using Clinical Drug Representations for Improving Mortality and Length of Stay Predictions

    Using Clinical Drug Representations for Improving Mortality and Length of Stay Predictions Usage Clone the code to local. https://github.com/tanlab/MI

    Computational Biology and Machine Learning lab @ TOBB ETU 3 Oct 18, 2022
    Scikit-learn compatible estimation of general graphical models

    skggm : Gaussian graphical models using the scikit-learn API In the last decade, learning networks that encode conditional independence relationships

    213 Jan 02, 2023
    Official implementation of "Membership Inference Attacks Against Self-supervised Speech Models"

    Introduction Official implementation of "Membership Inference Attacks Against Self-supervised Speech Models". In this work, we demonstrate that existi

    Wei-Cheng Tseng 7 Nov 01, 2022
    Photo2cartoon - 人像卡通化探索项目 (photo-to-cartoon translation project)

    人像卡通化 (Photo to Cartoon) 中文版 | English Version 该项目为小视科技卡通肖像探索项目。您可使用微信扫描下方二维码或搜索“AI卡通秀”小程序体验卡通化效果。

    Minivision_AI 3.5k Dec 30, 2022
    Scale-aware Automatic Augmentation for Object Detection (CVPR 2021)

    SA-AutoAug Scale-aware Automatic Augmentation for Object Detection Yukang Chen, Yanwei Li, Tao Kong, Lu Qi, Ruihang Chu, Lei Li, Jiaya Jia [Paper] [Bi

    DV Lab 182 Dec 29, 2022
    Code for CVPR2021 paper "Robust Reflection Removal with Reflection-free Flash-only Cues"

    Robust Reflection Removal with Reflection-free Flash-only Cues (RFC) Paper | To be released: Project Page | Video | Data Tensorflow implementation for

    Chenyang LEI 162 Jan 05, 2023
    Run Effective Large Batch Contrastive Learning on Limited Memory GPU

    Gradient Cache Gradient Cache is a simple technique for unlimitedly scaling contrastive learning batch far beyond GPU memory constraint. This means tr

    Luyu Gao 198 Dec 29, 2022
    DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference

    DeeBERT This is the code base for the paper DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference. Code in this repository is also available

    Castorini 132 Nov 14, 2022
    A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

    deepface Deepface is a lightweight face recognition and facial attribute analysis (age, gender, emotion and race) framework for python. It is a hybrid

    Sefik Ilkin Serengil 5.2k Jan 02, 2023
    Music library streaming app written in Flask & VueJS

    djtaytay This is a little toy app made to explore Vue, brush up on my Python, and make a remote music collection accessable through a web interface. I

    Ryan Tasson 6 May 27, 2022
    Visual Memorability for Robotic Interestingness via Unsupervised Online Learning (ECCV 2020 Oral and TRO)

    Visual Interestingness Refer to the project description for more details. This code based on the following paper. Chen Wang, Yuheng Qiu, Wenshan Wang,

    Chen Wang 36 Sep 08, 2022
    PyTorch implementation of PP-LCNet: A Lightweight CPU Convolutional Neural Network

    PyTorch implementation of PP-LCNet Reproduction of PP-LCNet architecture as described in PP-LCNet: A Lightweight CPU Convolutional Neural Network by C

    Quan Nguyen (Fly) 47 Nov 02, 2022
    PyTorch Implementation for "ForkGAN with SIngle Rainy NIght Images: Leveraging the RumiGAN to See into the Rainy Night"

    ForkGAN with Single Rainy Night Images: Leveraging the RumiGAN to See into the Rainy Night By Seri Lee, Department of Engineering, Seoul National Univ

    Seri Lee 52 Oct 12, 2022
    Reinforcement Learning for the Blackjack

    Reinforcement Learning for Blackjack Author: ZHA Mengyue Math Department of HKUST Problem Statement We study playing Blackjack by reinforcement learni

    Dolores 3 Jan 24, 2022
    SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model

    SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model Edresson Casanova, Christopher Shulby, Eren Gölge, Nicolas Michael Müller, Frede

    Edresson Casanova 92 Dec 09, 2022
    A multilingual version of MS MARCO passage ranking dataset

    mMARCO A multilingual version of MS MARCO passage ranking dataset This repository presents a neural machine translation-based method for translating t

    75 Dec 27, 2022
    Exact Pareto Optimal solutions for preference based Multi-Objective Optimization

    Exact Pareto Optimal solutions for preference based Multi-Objective Optimization

    Debabrata Mahapatra 40 Dec 24, 2022