2048-expectimax

Simulating an AI playing 2048 using the Expectimax algorithm

The base game engine uses code from here.

The AI player is modeled as a max player, and the computer as a chance player (picking a random open spot to place a 2-tile). The score returned by the game engine is used as the evaluation function value at the leaf nodes of the trees.

You can play the game manually using the arrow keys. Pressing 'Enter' will let the AI play, and pressing 'Enter' again will stop the AI player. Read the game engine code from 'game.py' and see how it returns the game state and evaluate its score from an arbitrary game state after an arbitrary player move.

A depth-3 game tree means the tree should have the following levels:

root: player
level 1: computer
level 2: player
level 3: terminal with payoff (note that we say "terminal" to mean the leaf nodes in the shallow game tree, not the termination of the game itself)

This tree represents all the game states of a player-computer-player sequence (the player makes a move, the computer place a tile, and then the player makes another move, and then evaluate the score) from the current state.

Usage

To run the program:

    python main.py

Once your program is running, here are a few keyboard options available in-game:

'r': restart the game
'u': undo a move
'3'-'7': change board size
'g': toggle grayscale

Simulating an AI playing 2048 using the Expectimax algorithm

Related tags

Overview

2048-expectimax

Usage

Owner

Subha Ramesh

Minimisation of a negative log likelihood fit to extract the lifetime of the D^0 meson (MNLL2ELDM)

Multiwavelets-based operator model

Stochastic Downsampling for Cost-Adjustable Inference and Improved Regularization in Convolutional Networks

Out-of-Domain Human Mesh Reconstruction via Dynamic Bilevel Online Adaptation

Offline Multi-Agent Reinforcement Learning Implementations: Solving Overcooked Game with Data-Driven Method

Official respository for "Modeling Defocus-Disparity in Dual-Pixel Sensors", ICCP 2020

AttentionGAN for Unpaired Image-to-Image Translation & Multi-Domain Image-to-Image Translation

CFNet: Cascade and Fused Cost Volume for Robust Stereo Matching（CVPR2021）

LQM - Improving Object Detection by Estimating Bounding Box Quality Accurately

Fully Convolutional Refined Auto Encoding Generative Adversarial Networks for 3D Multi Object Scenes

A nutritional label for food for thought.

Gym environments used in the paper: "Developmental Reinforcement Learning of Control Policy of a Quadcopter UAV with Thrust Vectoring Rotors"

ktrain is a Python library that makes deep learning and AI more accessible and easier to apply

FL-WBC: Enhancing Robustness against Model Poisoning Attacks in Federated Learning from a Client Perspective

A toolset for creating Qualtrics-based IAT experiments

Code for ICCV 2021 paper: ARAPReg: An As-Rigid-As Possible Regularization Loss for Learning Deformable Shape Generators..

Autonomous Ground Vehicle Navigation and Control Simulation Examples in Python

Air Pollution Prediction System using Linear Regression and ANN

Implementation of C-RNN-GAN.

SASM - simple crossplatform IDE for NASM, MASM, GAS and FASM assembly languages