A project studying the influence of communication in multi-objective normal-form games

Last update: Dec 17, 2021

Related tags

Overview

Communication in Multi-Objective Normal-Form Games

This repo consists of five different types of agents that we have used in our study of communication in multi-objective normal-form games. The settings that involve communication do this following a leader-follower model as seen in Stackelberg games. In such settings, agents switch in a round-robin fashion between being the leader and communicating something and being the follower and observing the communication.

No communication setting

In this setting two agents play a normal-form game for a certain amount of episodes. This experiment serves as a baseline for all other experiments.

Cooperative action communication setting

In this setting, agents communicate the next action that they will play. The follower uses this message to pre-update their policy. This setting is similar to Iterated Best Response and attempts to find the optimal joint policy.

Competitive action communication setting

This setting places the agents in a more competitive environment. This means that agents learn a specific best-response policy to every possible message. As such, agent's are not optimising for an optimal joint policy, but rather are acting in a self-interested manner.

Cooperative policy communication setting

This setting follows the same dynamics as the cooperative action communication setting, but communicates the entire policy instead of the next action that will be played.

Optional communication setting

The last setting gives agents the chance to learn for themselves whether communication helps them. All agents learn a top-level policy that chooses whether they will communicate when they are the leader or not. They also have two low-level agents, one "no communication agent" and one agent that does communicate. Which agent that is used as the communicating agent, is completely optional. When agents choose to communicate, they utilise their lower level communicating agent. When agents opt out of communication, they utilise their lower level no communication agent.

Getting Started

Experiments can be run from the MONFG.py file. There are 5 MONFGs available, having different equilibria properties under the SER optimisation criterion, using the specified non linear utility functions. You can also specify the type of experiment to run and other parameters.

License

This project is licensed under the GNU General Public License v3.0 - see the LICENSE file for details

A project studying the influence of communication in multi-objective normal-form games

Related tags

Overview

Communication in Multi-Objective Normal-Form Games

No communication setting

Cooperative action communication setting

Competitive action communication setting

Cooperative policy communication setting

Optional communication setting

Getting Started

License

Owner

Willem Röpke

COD-Rank-Localize-and-Segment (CVPR2021)

A simple approach to emable dense segmentation with ViT.

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Storage-optimizer - Identify potintial optimizations on the cloud storage accounts

An integration of several popular automatic augmentation methods, including OHL (Online Hyper-Parameter Learning for Auto-Augmentation Strategy) and AWS (Improving Auto Augment via Augmentation Wise Weight Sharing) by Sensetime Research.

Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch

A repo that contains all the mesh keys needed for mesh backend, along with a code example of how to use them in python

A general and strong 3D object detection codebase that supports more methods, datasets and tools (debugging, recording and analysis).

Pytorch implementation of Decoupled Spatial-Temporal Transformer for Video Inpainting

A synthetic texture-invariant dataset for object detection of UAVs

PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more

Unified API to facilitate usage of pre-trained "perceptor" models, a la CLIP

8-week curriculum for AI Builders

A way to store images in YAML.

Abstractive opinion summarization system (SelSum) and the largest dataset of Amazon product summaries (AmaSum). EMNLP 2021 conference paper.

Code to reproduce the results for Compositional Attention

Beyond Image to Depth: Improving Depth Prediction using Echoes (CVPR 2021)

Resco: A simple python package that report the effect of deep residual learning

The object detection pipeline is based on Ultralytics YOLOv5

Dataset and Source code of paper 'Enhancing Keyphrase Extraction from Academic Articles with their Reference Information'.