Reinforcement Learning with Q-Learning Algorithm on gym's frozen lake environment implemented in python

Overview

Reinforcement Learning with Q Learning Algorithm

Q learning algorithm is trained on the gym's frozen lake environment.

Libraries Used

  • gym
  • Numpy
  • tqdm
  • Pytorch Deep Learning Framework

  • Install Requirement Files

    clone the repository or download the 'requirement.txt' files, then open terminal in the working directory and type
    'pip install -r requirements.txt'
    to install all the requirements for this project.

    Demo Video

    Q-learning.mp4
    Predict bus arrival time using VertexAI and Nvidia's Jetson Nano

    bus_prediction predict bus arrival time using VertexAI and Nvidia's Jetson Nano imagenet the command for imagenet.py look like this python3 /path/to/i

    10 Dec 22, 2022
    Streamlit tool to explore coco datasets

    What is this This tool given a COCO annotations file and COCO predictions file will let you explore your dataset, visualize results and calculate impo

    Jakub Cieslik 75 Dec 16, 2022
    A pytorch implementation of Reading Wikipedia to Answer Open-Domain Questions.

    DrQA A pytorch implementation of the ACL 2017 paper Reading Wikipedia to Answer Open-Domain Questions (DrQA). Reading comprehension is a task to produ

    Runqi Yang 394 Nov 08, 2022
    PyTorch implementation code for the paper MixCo: Mix-up Contrastive Learning for Visual Representation

    How to Reproduce our Results This repository contains PyTorch implementation code for the paper MixCo: Mix-up Contrastive Learning for Visual Represen

    opcrisis 46 Dec 15, 2022
    [CVPR'21] Locally Aware Piecewise Transformation Fields for 3D Human Mesh Registration

    Locally Aware Piecewise Transformation Fields for 3D Human Mesh Registration This repository contains the implementation of our paper Locally Aware Pi

    sfwang 70 Dec 19, 2022
    1st place solution to the Satellite Image Change Detection Challenge hosted by SenseTime

    1st place solution to the Satellite Image Change Detection Challenge hosted by SenseTime

    Lihe Yang 209 Jan 01, 2023
    Official and maintained implementation of the paper "OSS-Net: Memory Efficient High Resolution Semantic Segmentation of 3D Medical Data" [BMVC 2021].

    OSS-Net: Memory Efficient High Resolution Semantic Segmentation of 3D Medical Data Christoph Reich, Tim Prangemeier, Özdemir Cetin & Heinz Koeppl | Pr

    Christoph Reich 23 Sep 21, 2022
    PyTorch implementation of paper "StarEnhancer: Learning Real-Time and Style-Aware Image Enhancement" (ICCV 2021 Oral)

    StarEnhancer StarEnhancer: Learning Real-Time and Style-Aware Image Enhancement (ICCV 2021 Oral) Abstract: Image enhancement is a subjective process w

    IDKiro 133 Dec 28, 2022
    A template repository for submitting a job to the Slurm Cluster installed at the DISI - University of Bologna

    Cluster di HPC con GPU per esperimenti di calcolo (draft version 1.0) Per poter utilizzare il cluster il primo passo è abilitare l'account istituziona

    20 Dec 16, 2022
    Updated for TTS(CE) = Also Known as TTN V3. The code requires the first server to be 'ttn' protocol.

    Updated Updated for TTS(CE) = Also Known as TTN V3. The code requires the first server to be 'ttn' protocol. Introduction This balenaCloud (previously

    Remko 1 Oct 17, 2021
    TrackTech: Real-time tracking of subjects and objects on multiple cameras

    TrackTech: Real-time tracking of subjects and objects on multiple cameras This project is part of the 2021 spring bachelor final project of the Bachel

    5 Jun 17, 2022
    Java and SHACL code commented in the paper "Towards compliance checking in reified I/O logic via SHACL" submitted to ICAIL 2021

    shRIOL The subfolder shRIOL contains Java files to execute the SHACL files on the OWL ontology. To compile the Java files: "javac -cp ./src/;./lib/* -

    1 Dec 06, 2022
    Deep Networks with Recurrent Layer Aggregation

    RLA-Net: Recurrent Layer Aggregation Recurrence along Depth: Deep Networks with Recurrent Layer Aggregation This is an implementation of RLA-Net (acce

    Joy Fang 21 Aug 16, 2022
    A `Neural = Symbolic` framework for sound and complete weighted real-value logic

    Logical Neural Networks LNNs are a novel Neuro = symbolic framework designed to seamlessly provide key properties of both neural nets (learning) and s

    International Business Machines 138 Dec 19, 2022
    Select, weight and analyze complex sample data

    Sample Analytics In large-scale surveys, often complex random mechanisms are used to select samples. Estimates derived from such samples must reflect

    samplics 37 Dec 15, 2022
    An Implementation of SiameseRPN with Feature Pyramid Networks

    SiameseRPN with FPN This project is mainly based on HelloRicky123/Siamese-RPN. What I've done is just add a Feature Pyramid Network method to the orig

    3 Apr 16, 2022
    Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2020

    Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2020

    Phillip Lippe 1.1k Jan 07, 2023
    Convolutional Neural Network for Text Classification in Tensorflow

    This code belongs to the "Implementing a CNN for Text Classification in Tensorflow" blog post. It is slightly simplified implementation of Kim's Convo

    Denny Britz 5.5k Jan 02, 2023
    A simple Python configuration file operator.

    A simple Python configuration file operator This project provides a common way to read configurations using config42. Installation It is possible to i

    Scott Lau 2 Nov 08, 2021
    Simple converter for deploying Stable-Baselines3 model to TFLite and/or Coral

    Running SB3 developed agents on TFLite or Coral Introduction I've been using Stable-Baselines3 to train agents against some custom Gyms, some of which

    Gary Briggs 16 Oct 11, 2022