An experiment on the performance of homemade Q-learning AIs in Agar.io depending on their state representation and available actions

Last update: Jun 09, 2022

Overview

Agar.io_Q-Learning_AI

An experiment on the performance of homemade Q-learning AIs in Agar.io depending on their state representation and available actions.

An image of the circle categorisation function in action. Food blobs are outlined in blue, edible cells in green and dangerous cells in red according to where our program detects them. Screen edges mess that up a bit. The agents action at this moment is labelled with the green arrow.

States are calculated using the shortest euclidian distance to each of the three circle types: food, edible cells and dangerous cells. These distances are measured and discretized according to which interval they fall within. The rulers in this image are to scale.

Currently the agent can't press any keyboard buttons, only move around using the mouse. It could be added without too much hassle, but it would require a rework of some aspects of the code and a ton training, which already takes ages. The q-learning part could also do with a proper implementation of stochastic q-learning instead of our generic iterative q-learning, if I knew how to do it. I look forward to learning that at a later point.

Feel free to ask any questions about the code or the project. I hope you enjoy!

The humans in the experiment were subject to the same move set as the bots and agents, so only mouse movement.

An experiment on the performance of homemade Q-learning AIs in Agar.io depending on their state representation and available actions

Related tags

Overview

Agar.io_Q-Learning_AI

Owner

Towards Flexible Blind JPEG Artifacts Removal (FBCNN, ICCV 2021)

Tooling for GANs in TensorFlow

Spatial Sparse Convolution Library

The code for Expectation-Maximization Attention Networks for Semantic Segmentation (ICCV'2019 Oral)

This project aims to segment 4 common retinal lesions from Fundus Images.

PyTorch implementation for ComboGAN

This repository contains all the code and materials distributed in the 2021 Q-Programming Summer of Qode.

A large-image collection explorer and fast classification tool

[RSS 2021] An End-to-End Differentiable Framework for Contact-Aware Robot Design

Self-Supervised Learning for Domain Adaptation on Point-Clouds

Code for "SRHEN: Stepwise-Refining Homography Estimation Network via Parsing Geometric Correspondences in Deep Latent Space"

Repository for the Bias Benchmark for QA dataset.

Code for "Learning Structural Edits via Incremental Tree Transformations" (ICLR'21)

VOLO: Vision Outlooker for Visual Recognition

A naive ROS interface for visualDet3D.

Cockpit is a visual and statistical debugger specifically designed for deep learning.

Official implementation of the paper Vision Transformer with Progressive Sampling, ICCV 2021.

Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"

ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information

DARTS-: Robustly Stepping out of Performance Collapse Without Indicators