ppo_pytorch_cpp - an implementation of the proximal policy optimization algorithm for the C++ API of Pytorch

Last update: Dec 09, 2022

Related tags

Overview

PPO Pytorch C++

This is an implementation of the proximal policy optimization algorithm for the C++ API of Pytorch. It uses a simple TestEnvironment to test the algorithm. Below is a small visualization of the environment, the algorithm is tested in.

Fig. 1: The agent in testing mode.

Build

You first need to install PyTorch. For a clean installation from Anaconda, checkout this short tutorial, or this tutorial, to only install the binaries.

mkdir build
cd build
cmake -DCMAKE_PREFIX_PATH=/absolut/path/to/libtorch ..
make

Run

Run the executable with

cd build
./train_ppo

It should produce something like shown below.

Fig. 2: From left to right, the agent for successive epochs in training mode as it takes actions in the environment to reach the goal.

The algorithm can also be used in test mode, once trained. Therefore, run

cd build
./test_ppo

Visualization

The results are saved to data/data.csv and can be visualized by running python plot.py.

Owner

Martin Huber

Hi :), I'm Martin.

GitHub Repository

ppo_pytorch_cpp - an implementation of the proximal policy optimization algorithm for the C++ API of Pytorch

Related tags

Overview

PPO Pytorch C++

Build

Run

Visualization

Owner

Martin Huber

Pseudo-rng-app - whos needs science to make a random number when you have pseudoscience?

Run containerized, rootless applications with podman

Official codebase for Legged Robots that Keep on Learning: Fine-Tuning Locomotion Policies in the Real World

A Differentiable Recipe for Learning Visual Non-Prehensile Planar Manipulation

On the model-based stochastic value gradient for continuous reinforcement learning

Deep Sea Treasure Environment for Multi-Objective Optimization Research

A package for "Procedural Content Generation via Reinforcement Learning" OpenAI Gym interface.

Revealing and Protecting Labels in Distributed Training

Yolov5 deepsort inference，使用YOLOv5+Deepsort实现车辆行人追踪和计数，代码封装成一个Detector类，更容易嵌入到自己的项目中

EGNN - Implementation of E(n)-Equivariant Graph Neural Networks, in Pytorch

Dark Finix: All in one hacking framework with almost 100 tools

This repository contains the segmentation user interface from the OpenSurfaces project, extracted as a lightweight tool

Riemannian Convex Potential Maps

4th place solution for the SIGIR 2021 challenge.

Liquid Warping GAN with Attention: A Unified Framework for Human Image Synthesis

Neighborhood Contrastive Learning for Novel Class Discovery

Streamlit Tutorial (ex: stock price dashboard, cartoon-stylegan, vqgan-clip, stylemixing, styleclip, sefa)

It's final year project of Diploma Engineering. This project is based on Computer Vision.

TGRNet: A Table Graph Reconstruction Network for Table Structure Recognition

Implementation of the paper "Language-agnostic representation learning of source code from structure and context".