Deeprl - Standard DQN and dueling network for simple games

Last update: Apr 12, 2020

Overview

DeepRL

This code implements the standard deep Q-learning and dueling network with experience replay (memory buffer) for playing simple games.

DQN algorithm implemented in this code is from the Google DeepMind's paper Playing Atari with Deep Reinforcement Learning[link].

Dueling network is from the paper Dueling Network Architectures for Deep Reinforcement Learning [link]

Requirement

DeepRL is implemented with Torch and the packages of its ecosystem. This code is well worked on my Mac Pro with CPU (I haven't tested it on Linux and GPU). Install Torch7 firstly, then you should install the following packages by luarocks

luarocks install nn
luarocks install image
luarocks install qt
luarocks install optim

Running

You can run this code by tapping the command in the project dir.

qlua main.lua

The result looks like

DQN: I got the accuracy of 93.2% (932 success of 1000 epochs).

Dueling: I got the accuracy of 99.2% (992 success of 1000 epochs).

Code

The envir.lua indicates the environment in reinforcement learning stage, which receives the action and produces the states and a reward for agent.

The agent.lua is the implementation of agent which receives the states and reward to produce the action directed by the policy network.

The learner.lua is the learning algorithm of DQN with experience replay as the following.

MISC

I completed this code when I was an intern at Horizon Robotics. I will greatly thank the article of Andrej Karpathy and other implementations:SeanNaren's code and EderSantana's gist.

LICENSE

MIT

Deeprl - Standard DQN and dueling network for simple games

Related tags

Overview

DeepRL

Requirement

Running

Code

MISC

LICENSE

Owner

Yao Zhou

ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information

Yolact-keras实例分割模型在keras当中的实现

Export CenterPoint PonintPillars ONNX Model For TensorRT

Automatically replace ONNX's RandomNormal node with Constant node.

Navigating StyleGAN2 w latent space using CLIP

Pytorch implementation of Each Part Matters: Local Patterns Facilitate Cross-view Geo-localization https://arxiv.org/abs/2008.11646

Benchmarks for Model-Based Optimization

An offline deep reinforcement learning library

U-Net Implementation: Convolutional Networks for Biomedical Image Segmentation" using the Carvana Image Masking Dataset in PyTorch

This repository includes code of my study about Asynchronous in Frequency domain of GAN images.

NeurIPS 2021 paper 'Representation Learning on Spatial Networks' code

Code of the paper "Deep Human Dynamics Prior" in ACM MM 2021.

A python script to convert images to animated sus among us crewmate twerk jifs as seen on r/196

Soft actor-critic is a deep reinforcement learning framework for training maximum entropy policies in continuous domains.

Reduce end to end training time from days to hours (or hours to minutes), and energy requirements/costs by an order of magnitude using coresets and data selection.

基于Pytorch实现优秀的自然图像分割框架！(包括FCN、U-Net和Deeplab)

Highway networks implemented in PyTorch.

Code for the RA-L (ICRA) 2021 paper "SeqNet: Learning Descriptors for Sequence-Based Hierarchical Place Recognition"

PySOT - SenseTime Research platform for single object tracking, implementing algorithms like SiamRPN and SiamMask.

Corruption Invariant Learning for Re-identification