Update for using DQN to play sekiro 2021.2.2（English Version）

This is the code of using DQN to play Sekiro .

I am very glad to tell that I have writen the codes of using DQN to play Sekiro . As is known to all , Supervised learning can only learn skills from the data we provide for it . However , this time by using Reinforcement Learning , we can see a more clever agent playing Sekiro .

Reinforcement Learning can update its network by itself , using the reward feedback , which means we no longer need to collect our own data sets this time . All the data sets come from the real-time interaction between DQN network and the game. By using this DQN network , you can fight any boss you want in the game . There still something you need to know ：

In order to shorten the time of restart the game , we need game modifier .
The link for game modifier : https://patch.ali213.net/showpatch/118405.html
See the tutorial video for specific code usage , link : https://www.bilibili.com/video/BV1by4y1n7pe/

Have fun !

Old version sekiro_tensorflow

Code link for using Supervised learning to play Sekiro : https://github.com/analoganddigital/sekiro_tensorflow

Hello everyone , this is analoganddigital . I use this code to complete an interesting porgram of using machine learning to play Sekiro . You can see the final presentation in https://www.bilibili.com/video/BV1wC4y1s7oa/ . I am a junior student in university , which means I can't spend too much time on this program . What a shame ! On the other hand , many audiences hope me share this code . Thus , I eventually put it on the GitHub . This is an interesting program , and I hope everyone can enjoy it. In addition , I really welcome you to improve this program , to make this AI more smart ! There still something you need to konw:

The window size I set is 96*86 , you can change it by yourselves .
I finally collected 300M training data , if you want better result , maybe you need to collect more data .
I use Alexnet to finish the training . This program is depend on Supervised learning.
I have no idea about using Reinforcement learning yet , so I will really appreciate it if someone can help me to overcome this difficulty.(already finished)
See the tutorial video for specific code usage , link : https://www.bilibili.com/video/BV1bz4y1R7kB

Reference ： https://github.com/Sentdex/pygta5/blob/master/LICENSE

更新——强化学习DQN打只狼 2021.2.2（中文说明）

我非常高兴地告诉大家，我最近又开发出了用DQN强化学习打只狼的代码。众所周知，监督学习只能学习到我们所提供的数据集的相关技能，但是利用强化学习，我们将看到一个完全不一样的只狼。

强化学习会根据reward奖励进行判断并且自己学习一种打斗方法。更重要的是，我们这次不再需要自己收集数据集了，所有更新数据均来自于DQN网络与游戏的实时交互。利用这个DQN代码（链接见下方），你可以挑战只狼中任何一个boss，只要boss的血条位置不变即可（因为我采用的是图像抓取的方式获取只狼的血量与boss的血量进行reward判断）。然后还有一些注意事项：

为了缩短只狼复活周期，在这个项目训练中，我们需要采用只狼的24项修改器，让只狼能够原地复活继续训练。
修改器下载地址：https://patch.ali213.net/showpatch/118405.html
具体代码使用方法请见我在b站上发布的DQN打只狼的教程视频，链接：https://www.bilibili.com/video/BV1by4y1n7pe/

祝各位玩得愉快！

旧版本用机器学习打只狼

旧版本的利用监督学习打只狼的代码链接： https://github.com/analoganddigital/sekiro_tensorflow 。

各位观众大家好，我GitHub用户名是analoganddigital。我用这个程序完成了机器学习打只狼这个项目。最终效果视频可以看b站https://www.bilibili.com/video/BV1wC4y1s7oa/ 。我是一个大三学生，真的非常抱歉没能长时间更新这个项目，所以我把它放到了GitHub上面，之前很多观众也是私信我想要代码。总之我还是希望大家能喜欢这个小项目吧。当然，我非常希望大家能帮忙完善这个程序，万分感激，大家共同讨论我们会获益更多，这其实就是开源的意义。现在由于代码比较基础，所以训练效果不太好。我相信大家会有更多的点子，如果能更新一点算法，我们将会看到一个更机智的AI。我很感谢大家对之前视频的支持（受宠若惊），也十分期待大家有趣的优化，就算没有优化直接用也可以。还有一些细节我这声明一下：

我截取的图像大小是96*86的，各位可以根据自身情况选择。
我最终只收集了300M的数据，如果你想训练效果更好的话，可能要收集更多。
我用的神经网络是Alexnet，基于监督学习完成的。
由于我能力有限，我还没想好如何用强化学习优化算法，所以如果有大佬能分享一下自己的才华，那将十分感谢。（目前已经实现）
具体代码使用方法请见我在b站上发布的机器学习打只狼的教程视频，链接： https://www.bilibili.com/video/BV1bz4y1R7kB

部分参考代码： https://github.com/Sentdex/pygta5/blob/master/LICENSE

This is the code of using DQN to play Sekiro .

Related tags

Overview

Update for using DQN to play sekiro 2021.2.2（English Version）

Old version sekiro_tensorflow

更新——强化学习DQN打只狼 2021.2.2（中文说明）

旧版本用机器学习打只狼

Owner

An unofficial PyTorch implementation of a federated learning algorithm, FedAvg.

This project aims at providing a concise, easy-to-use, modifiable reference implementation for semantic segmentation models using PyTorch.

The implementation of 'Image synthesis via semantic composition'.

This repository contains FEDOT - an open-source framework for automated modeling and machine learning (AutoML)

Fast convergence of detr with spatially modulated co-attention

moving object detection for satellite videos.

A rule learning algorithm for the deduction of syndrome definitions from time series data.

Complex-Valued Neural Networks (CVNN)Complex-Valued Neural Networks (CVNN)

A deep learning based semantic search platform that computes similarity scores between provided query and documents

D²Conv3D: Dynamic Dilated Convolutions for Object Segmentation in Videos

A Kaggle competition: discriminate gender based on handwriting

Effect of Different Encodings and Distance Functions on Quantum Instance-based Classifiers

SMPLpix: Neural Avatars from 3D Human Models

Code for Universal Semi-Supervised Semantic Segmentation models paper accepted in ICCV 2019

Language-Agnostic Website Embedding and Classification

PSPNet in Chainer

PenguinSpeciesPredictionML - Basic model to predict Penguin species based on beak size and sex.

Towards Ultra-Resolution Neural Style Transfer via Thumbnail Instance Normalization

ECCV18 Workshops - Enhanced SRGAN. Champion PIRM Challenge on Perceptual Super-Resolution. The training codes are in BasicSR.

TensorFlow ROCm port