Pytorch implementation of Distributed Proximal Policy Optimization

Last update: Jan 05, 2023

Related tags

Overview

Pytorch-DPPO

Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286 Using PPO with clip loss (from https://arxiv.org/pdf/1707.06347.pdf).

I finally fixed what was wrong with the gradient descent step, using previous log-prob from rollout batches. At least ppo.py is fixed, the rest is going to be corrected as well very soon.

In the following example I was not patient enough to wait for million iterations, I just wanted to check if the model is properly learning:

Progress of single PPO:

InvertedPendulum

InvertedDoublePendulum

HalfCheetah

hopper (PyBullet)

halfcheetah (PyBullet)

Progress of DPPO (4 agents) [TODO]

Acknowledgments

The structure of this code is based on https://github.com/ikostrikov/pytorch-a3c.

Hyperparameters and loss computation has been taken from https://github.com/openai/baselines

Owner

Alexis David Jacq

GitHub Repository

Pretrained ConvNets for pytorch: NASNet, ResNeXt, ResNet, InceptionV4, InceptionResnetV2, Xception, DPN, etc.

Pretrained models for Pytorch (Work in progress) The goal of this repo is: to help to reproduce research papers results (transfer learning setups for

8.7k Dec 31, 2022

Reformer, the efficient Transformer, in Pytorch

Reformer, the Efficient Transformer, in Pytorch This is a Pytorch implementation of Reformer https://openreview.net/pdf?id=rkgNKkHtvB It includes LSH

1.8k Jan 06, 2023

A Pytorch Implementation for Compact Bilinear Pooling.

CompactBilinearPooling-Pytorch A Pytorch Implementation for Compact Bilinear Pooling. Adapted from tensorflow_compact_bilinear_pooling Prerequisites I

169 Dec 23, 2022

High-fidelity performance metrics for generative models in PyTorch

5 Oct 24, 2021

PyTorch Extension Library of Optimized Scatter Operations

PyTorch Scatter Documentation This package consists of a small extension library of highly optimized sparse update (scatter and segment) operations fo

1.2k Jan 07, 2023

lookahead optimizer (Lookahead Optimizer: k steps forward, 1 step back) for pytorch

lookahead optimizer for pytorch PyTorch implement of Lookahead Optimizer: k steps forward, 1 step back Usage: base_opt = torch.optim.Adam(model.parame

318 Dec 09, 2022

On the Variance of the Adaptive Learning Rate and Beyond

RAdam On the Variance of the Adaptive Learning Rate and Beyond We are in an early-release beta. Expect some adventures and rough edges. Table of Conte

2.5k Dec 27, 2022

A very simple and small path tracer written in pytorch meant to be run on the GPU

MentisOculi Pytorch Path Tracer A very simple and small path tracer written in pytorch meant to be run on the GPU Why use pytorch and not some other c

222 Dec 01, 2022

PyTorch to TensorFlow Lite converter

140 Dec 13, 2022

A lightweight wrapper for PyTorch that provides a simple declarative API for context switching between devices, distributed modes, mixed-precision, and PyTorch extensions.

56 Sep 13, 2022

Code for paper "Energy-Constrained Compression for Deep Neural Networks via Weighted Sparse Projection and Layer Input Masking"

model_based_energy_constrained_compression Code for paper "Energy-Constrained Compression for Deep Neural Networks via Weighted Sparse Projection and

16 Jun 15, 2022

Pytorch implementation of Distributed Proximal Policy Optimization

Related tags

Overview

Pytorch-DPPO

Progress of single PPO:

Acknowledgments

Owner

Alexis David Jacq

Pretrained ConvNets for pytorch: NASNet, ResNeXt, ResNet, InceptionV4, InceptionResnetV2, Xception, DPN, etc.

Reformer, the efficient Transformer, in Pytorch

A Pytorch Implementation for Compact Bilinear Pooling.

High-fidelity performance metrics for generative models in PyTorch

PyTorch Extension Library of Optimized Scatter Operations

lookahead optimizer (Lookahead Optimizer: k steps forward, 1 step back) for pytorch

On the Variance of the Adaptive Learning Rate and Beyond

A very simple and small path tracer written in pytorch meant to be run on the GPU

PyTorch to TensorFlow Lite converter

A lightweight wrapper for PyTorch that provides a simple declarative API for context switching between devices, distributed modes, mixed-precision, and PyTorch extensions.

Code for paper "Energy-Constrained Compression for Deep Neural Networks via Weighted Sparse Projection and Layer Input Masking"

3D-RETR: End-to-End Single and Multi-View3D Reconstruction with Transformers

Differentiable ODE solvers with full GPU support and O(1)-memory backpropagation.

This is an differentiable pytorch implementation of SIFT patch descriptor.

You like pytorch? You like micrograd? You love tinygrad! ❤️

A pure Python implementation of Compact Bilinear Pooling and Count Sketch for PyTorch.

A PyTorch implementation of Learning to learn by gradient descent by gradient descent

A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision

A collection of extensions and data-loaders for few-shot learning & meta-learning in PyTorch

Differentiable SDE solvers with GPU support and efficient sensitivity analysis.