Conjugated Discrete Distributions for Distributional Reinforcement Learning (C2D)

Last update: Jan 11, 2022

Related tags

Deep Learning c2d

Overview

Conjugated Discrete Distributions for Distributional Reinforcement Learning (C2D)

Code & Data Appendix for Conjugated Discrete Distributions for Distributional Reinforcement Learning.

Björn Lindenberg, Jonas Nordqvist, Karl-Olof Lindahl

Citation

If you use C2D in your research we ask you to please cite the following:

@misc{lindenberg2021conjugated,
      title={Conjugated Discrete Distributions for Distributional Reinforcement Learning}, 
      author={Björn Lindenberg and Jonas Nordqvist and Karl-Olof Lindahl},
      year={2021},
      eprint={2112.07424},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

Data

Agent scores are available in the data folder.
Raw experiment data for each seed is available in the folder data/supplementary.
Each seed was run on a VM Ubuntu 20.04 server with 64GB RAM, a single Nvidia Quadro P4000 GPU and TensorFlow 2.5.

Code

The C++20 source code that handles ALE and transition buffering resides in src.
The agent code, written in TensorFlow/Python (with algorithms), can be viewed in c2d.
Requires cuDNN, TensorFlow 2.X, python3, The Arcade Learning Environment, C++20 and LZ4. For a comprehensive view of dependencies, have a look at our VM setup files in install_scripts.

Atari Games

To avoid legal issues, our Atari 2600 rom file directory ale_roms is left empty. However the corresponding binaries are widely available for import from elsewhere, e.g., Breakout or breakout.bin can be extracted from the atari-py Python package.

Library

The directory ale_roms needs to be populated by the relevant binaries of different Atari games. ALE's checksum file md5.txt for checking binary compatibility is present in the root directory.
The initial library setup or any changes to settings.cmake will require compilation by
```
bash build_lib.sh
```
One can train for one iteration (1M frames) in Breakout with:
```
python3 run.py --game breakout --tag test --iterations 1
```

Conjugated Discrete Distributions for Distributional Reinforcement Learning (C2D)

Related tags

Overview

Conjugated Discrete Distributions for Distributional Reinforcement Learning (C2D)

Citation

Data

Code

Atari Games

Library

Figures

Performance Profile (Deep reinforcement learning at the edge of the statistical precipice, Agarwal et al. 2021)

Sampling Efficiency: Mean and Median

Training Graphs

Strong/Weak Examples

Support Evolution

Owner

TransReID: Transformer-based Object Re-Identification

A New Open-Source Off-road Environment for Benchmark Generalization of Autonomous Driving

[CVPR'2020] DeepDeform: Learning Non-rigid RGB-D Reconstruction with Semi-supervised Data

Python code to generate art with Generative Adversarial Network

Neural-PIL: Neural Pre-Integrated Lighting for Reflectance Decomposition - NeurIPS2021

一个多模态内容理解算法框架，其中包含数据处理、预训练模型、常见模型以及模型加速等模块。

A tutorial showing how to train, convert, and run TensorFlow Lite object detection models on Android devices, the Raspberry Pi, and more!

HyDiff: Hybrid Differential Software Analysis

Hierarchical Aggregation for 3D Instance Segmentation (ICCV 2021)

Supervised forecasting of sequential data in Python.

Official pytorch implementation of the AAAI 2021 paper Semantic Grouping Network for Video Captioning

PyTorch implementation of federated learning framework based on the acceleration of global momentum

Context-Aware Image Matting for Simultaneous Foreground and Alpha Estimation

Balancing Principle for Unsupervised Domain Adaptation

Efficient face emotion recognition in photos and videos

基于Flask开发后端、VUE开发前端框架，在WEB端部署YOLOv5目标检测模型

Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition

This project deploys a yolo fastest model in the form of tflite on raspberry 3b+. The model is from another repository of mine called -Trash-Classification-Car

Exploring Simple Siamese Representation Learning

Pytorch implementation of DeePSiM