Reinforcement Learning with Q-Learning Algorithm on gym's frozen lake environment implemented in python

Overview

Reinforcement Learning with Q Learning Algorithm

Q learning algorithm is trained on the gym's frozen lake environment.

Libraries Used

  • gym
  • Numpy
  • tqdm
  • Pytorch Deep Learning Framework

  • Install Requirement Files

    clone the repository or download the 'requirement.txt' files, then open terminal in the working directory and type
    'pip install -r requirements.txt'
    to install all the requirements for this project.

    Demo Video

    Q-learning.mp4
    NNR conformation conditional and global probabilities estimation and analysis in peptides or proteins fragments

    NNR and global probabilities estimation and analysis in peptides or protein fragments This module calculates global and NNR conformation dependent pro

    0 Jul 15, 2021
    Text completion with Hugging Face and TensorFlow.js running on Node.js

    Katana ML Text Completion 🤗 Description Runs with with Hugging Face DistilBERT and TensorFlow.js on Node.js distilbert-model - converter from Hugging

    Katana ML 2 Nov 04, 2022
    Flexible Option Learning - NeurIPS 2021

    Flexible Option Learning This repository contains code for the paper Flexible Option Learning presented as a Spotlight at NeurIPS 2021. The implementa

    Martin Klissarov 7 Nov 09, 2022
    Export CenterPoint PonintPillars ONNX Model For TensorRT

    CenterPoint-PonintPillars Pytroch model convert to ONNX and TensorRT Welcome to CenterPoint! This project is fork from tianweiy/CenterPoint. I impleme

    CarkusL 149 Dec 13, 2022
    Official Pytorch implementation of Online Continual Learning on Class Incremental Blurry Task Configuration with Anytime Inference (ICLR 2022)

    The Official Implementation of CLIB (Continual Learning for i-Blurry) Online Continual Learning on Class Incremental Blurry Task Configuration with An

    NAVER AI 34 Oct 26, 2022
    Applicator Kit for Modo allow you to apply Apple ARKit Face Tracking data from your iPhone or iPad to your characters in Modo.

    Applicator Kit for Modo Applicator Kit for Modo allow you to apply Apple ARKit Face Tracking data from your iPhone or iPad with a TrueDepth camera to

    Andrew Buttigieg 3 Aug 24, 2021
    [AAAI2022] Source code for our paper《Suppressing Static Visual Cues via Normalizing Flows for Self-Supervised Video Representation Learning》

    SSVC The source code for paper [Suppressing Static Visual Cues via Normalizing Flows for Self-Supervised Video Representation Learning] samples of the

    7 Oct 26, 2022
    Training a deep learning model on the noisy CIFAR dataset

    Training-a-deep-learning-model-on-the-noisy-CIFAR-dataset This repository contai

    1 Jun 14, 2022
    Let's Git - Versionsverwaltung & Open Source Hausaufgabe

    Let's Git - Versionsverwaltung & Open Source Hausaufgabe Herzlich Willkommen zu dieser Hausaufgabe fĂĽr unseren MOOC: Let's Git! Wir hoffen, dass Du vi

    1 Dec 13, 2021
    A U-Net combined with a variational auto-encoder that is able to learn conditional distributions over semantic segmentations.

    Probabilistic U-Net + **Update** + An improved Model (the Hierarchical Probabilistic U-Net) + LIDC crops is now available. See below. Re-implementatio

    Simon Kohl 498 Dec 26, 2022
    A hybrid SOTA solution of LiDAR panoptic segmentation with C++ implementations of point cloud clustering algorithms. ICCV21, Workshop on Traditional Computer Vision in the Age of Deep Learning

    ICCVW21-TradiCV-Survey-of-LiDAR-Cluster Motivation In contrast to popular end-to-end deep learning LiDAR panoptic segmentation solutions, we propose a

    YimingZhao 103 Nov 22, 2022
    PyTorch implementation of the end-to-end coreference resolution model with different higher-order inference methods.

    End-to-End Coreference Resolution with Different Higher-Order Inference Methods This repository contains the implementation of the paper: Revealing th

    Liyan 52 Jan 04, 2023
    Where2Act: From Pixels to Actions for Articulated 3D Objects

    Where2Act: From Pixels to Actions for Articulated 3D Objects The Proposed Where2Act Task. Given as input an articulated 3D object, we learn to propose

    Kaichun Mo 69 Nov 28, 2022
    Backdoor Attack through Frequency Domain

    Backdoor Attack through Frequency Domain DEPENDENCIES python==3.8.3 numpy==1.19.4 tensorflow==2.4.0 opencv==4.5.1 idx2numpy==1.2.3 pytorch==1.7.0 Data

    5 Jun 18, 2022
    Code to accompany the paper "Finding Bipartite Components in Hypergraphs", which is published in NeurIPS'21.

    Finding Bipartite Components in Hypergraphs This repository contains code to accompany the paper "Finding Bipartite Components in Hypergraphs", publis

    Peter Macgregor 5 May 06, 2022
    Novel and high-performance medical image classification pipelines are heavily utilizing ensemble learning strategies

    An Analysis on Ensemble Learning optimized Medical Image Classification with Deep Convolutional Neural Networks Novel and high-performance medical ima

    14 Dec 18, 2022
    [CVPR 2021] "The Lottery Tickets Hypothesis for Supervised and Self-supervised Pre-training in Computer Vision Models" Tianlong Chen, Jonathan Frankle, Shiyu Chang, Sijia Liu, Yang Zhang, Michael Carbin, Zhangyang Wang

    The Lottery Tickets Hypothesis for Supervised and Self-supervised Pre-training in Computer Vision Models Codes for this paper The Lottery Tickets Hypo

    VITA 59 Dec 28, 2022
    GraphRNN: Generating Realistic Graphs with Deep Auto-regressive Models

    GraphRNN: Generating Realistic Graphs with Deep Auto-regressive Model This repository is the official PyTorch implementation of GraphRNN, a graph gene

    Jiaxuan 568 Dec 29, 2022
    CVPR2022 (Oral) - Rethinking Semantic Segmentation: A Prototype View

    Rethinking Semantic Segmentation: A Prototype View Rethinking Semantic Segmentation: A Prototype View, Tianfei Zhou, Wenguan Wang, Ender Konukoglu and

    Tianfei Zhou 239 Dec 26, 2022
    Automated Melanoma Recognition in Dermoscopy Images via Very Deep Residual Networks

    Introduction This repository contains the modified caffe library and network architectures for our paper "Automated Melanoma Recognition in Dermoscopy

    Lequan Yu 47 Nov 24, 2022