用强化学习DQN算法，训练AI模型来玩合成大西瓜游戏，提供Keras版本和PARL（paddle）版本

Last update: Dec 17, 2022

Overview

用强化学习玩合成大西瓜

代码地址：https://github.com/Sharpiless/play-daxigua-using-Reinforcement-Learning

用强化学习DQN算法，训练AI模型来玩合成大西瓜游戏，提供Keras版本、PARL（paddle）版本和pytorch版本。

B站：https://space.bilibili.com/470550823

CSDN：https://blog.csdn.net/weixin_44936889

AI Studio：https://aistudio.baidu.com/aistudio/personalcenter/thirdview/67156

Github：https://github.com/Sharpiless

1. 打开游戏：

这里使用pygame重写了大西瓜游戏，并封装为适合RL环境的代码。

解压图片素材：

unzip res.zip

运行：

python Main.py

即可开始游戏：

2. 训练RL模型：

RL算法采用DQN算法，其中Keras版本使用了简单的卷积神经网络来计算Q值，PRAL版本使用ResNet。

运行：

python train_keras.py

或者

python train_paddle.py

或者

python train_torch.py

开始训练：

关注我的公众号：

感兴趣的同学关注我的公众号——可达鸭的深度学习教程：

用强化学习DQN算法，训练AI模型来玩合成大西瓜游戏，提供Keras版本和PARL（paddle）版本

Related tags

Overview

用强化学习玩合成大西瓜

1. 打开游戏：

2. 训练RL模型：

关注我的公众号：

Owner

Sarus implementation of classical ML models. The models are implemented using the Keras API of tensorflow 2. Vizualization are implemented and can be seen in tensorboard.

Feature extraction made simple with torchextractor

Implementations of paper Controlling Directions Orthogonal to a Classifier

UNION: An Unreferenced Metric for Evaluating Open-ended Story Generation

Hyperparameter tuning for humans

《LXMERT: Learning Cross-Modality Encoder Representations from Transformers》(EMNLP 2020)

Semi-Supervised Learning, Object Detection, ICCV2021

Isaac Gym Reinforcement Learning Environments

CLIP (Contrastive Language–Image Pre-training) trained on Indonesian data

A pure PyTorch batched computation implementation of "CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition"

An image base contains 490 images for learning (400 cars and 90 boats), and another 21 images for testingAn image base contains 490 images for learning (400 cars and 90 boats), and another 21 images for testing

Implementation of our recent paper, WOOD: Wasserstein-based Out-of-Distribution Detection.

A universal memory dumper using Frida

Minimal implementation of Denoised Smoothing: A Provable Defense for Pretrained Classifiers in TensorFlow.

Spatial Attentive Single-Image Deraining with a High Quality Real Rain Dataset (CVPR'19)

4th place solution for the SIGIR 2021 challenge.

Turn based roguelike in python

Pytorch code for "Text-Independent Speaker Verification Using 3D Convolutional Neural Networks".

Adversarial Adaptation with Distillation for BERT Unsupervised Domain Adaptation

Data, model training, and evaluation code for "PubTables-1M: Towards a universal dataset and metrics for training and evaluating table extraction models".