[NeurIPS 2021] Official implementation of paper "Learning to Simulate Self-driven Particles System with Coordinated Policy Optimization".

Last update: Dec 19, 2022

Overview

Code for Coordinated Policy Optimization

Webpage | Code | Paper | Talk (English) | Talk (Chinese)

Hi there! This is the source code of the paper “Learning to Simulate Self-driven Particles System with Coordinated Policy Optimization”.

Please following the tutorial below to kickoff the reproduction of our results.

Installation

# Create virtual environment
conda create -n copo python=3.7
conda activate copo

# Install dependency
pip install metadrive-simulator==0.2.3
pip install torch  # Make sure your torch is successfully installed! Especially when using GPU!

# Install environment and algorithm.
cd code
pip install -e .

Training

As a quick start, you can start training CoPO in Intersection environment immediately after installation by running:

cd code/copo/
python inter/train_copo_dist.py --exp-name inter_copo_dist

The general way to run training is following:

cd code/copo/
python ENV/train_ALGO.py --exp-name EXPNAME

Here ENV refers to the shorthand of environments:

round  # Roundabout
inter  # Intersection
bottle  # Bottleneck
parking  # Parking Lot
tollgate  # Tollgate

and ALGO is the shorthand for algorithms:

ippo  # Individual Policy Optimization
ccppo  # Mean Field Policy Optimization
cl  # Curriculum Learning
copo_dist  # Coordinated Policy Optimiztion (Ours)
copo_dist_cc  # Coordinated Policy Optimiztion with Centralized Critics

finally the EXPNAME is arbitrary name to denote the experiment (with multiple concurrent trials), such as roundabout_copo.

Visualization

We provide the trained models for all algorithms in all environments. A simple command can bring you the visualization of the behaviors of the populations!

cd copo
python vis.py 

# In default, we provide you the CoPO population in Intersection environment. 
# If you want to see others, try:
python vis.py --env round --algo ippo

# Or you can use the native renderer for 3D rendering:
# (Press H to show helper message)
python vis.py --env tollgate --algo cl --use_native_render

We hope you enjoy the interesting behaviors learned in this work! Please feel free to contact us if you have any questions, thanks!

Citation

@misc{peng2021learning,
      title={Learning to Simulate Self-Driven Particles System with Coordinated Policy Optimization}, 
      author={Zhenghao Peng and Quanyi Li and Ka Ming Hui and Chunxiao Liu and Bolei Zhou},
      year={2021},
      eprint={2110.13827},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

[NeurIPS 2021] Official implementation of paper "Learning to Simulate Self-driven Particles System with Coordinated Policy Optimization".

Related tags

Overview

Code for Coordinated Policy Optimization

Installation

Training

Visualization

Citation

Owner

DeciForce: Crossroads of Machine Perception and Autonomy

LightNet++: Boosted Light-weighted Networks for Real-time Semantic Segmentation

This framework implements the data poisoning method found in the paper Adversarial Examples Make Strong Poisons

Unofficial TensorFlow implementation of Protein Interface Prediction using Graph Convolutional Networks.

A python script to lookup Passport Index Dataset

More than a hundred strange attractors

Employs neural networks to classify images into four categories: ship, automobile, dog or frog

Used to record WKU's utility bills on a regular basis.

An executor that loads ONNX models and embeds documents using the ONNX runtime.

Toward Realistic Single-View 3D Object Reconstruction with Unsupervised Learning from Multiple Images (ICCV 2021)

Python scripts for performing stereo depth estimation using the HITNET Tensorflow model.

Cache Requests in Deta Bases and Echo them with Deta Micros

A Bayesian cognition approach for belief updating of correlation judgement through uncertainty visualizations

A Simulation Environment to train Robots in Large Realistic Interactive Scenes

《LightXML: Transformer with dynamic negative sampling for High-Performance Extreme Multi-label Text Classiﬁcation》(AAAI 2021) GitHub:

PyTorch implementation of "A Two-Stage End-to-End System for Speech-in-Noise Hearing Aid Processing"

This repository collects 100 papers related to negative sampling methods.

Official implementation of Deep Burst Super-Resolution

Code for layerwise detection of linguistic anomaly paper (ACL 2021)

pcnaDeep integrates cutting-edge detection techniques with tracking and cell cycle resolving models.

The official implementation for "FQ-ViT: Fully Quantized Vision Transformer without Retraining".