PPO Lagrangian in JAX

Last update: Sep 14, 2022

Related tags

Deep Learning jax-ppo

Overview

PPO Lagrangian in JAX

This repository implements PPO in JAX. Implementation is tested on the safety-gym benchmark.

Usage

Install dependencies using the following-

pip install -r requirements.txt

Install safety-gym (after installing mujoco-py) using the following-

git clone https://github.com/openai/safety-gym.git
cd safety-gym
pip install -e .

Train the PPO agent using the following-

python train.py --env=Safexp-CarGoal1-v0

Results will be stored in the logs folder. To create a plot run the following-

python plot.py

Citation

In case you find the code helpful then please cite the following-

@misc{ppolag,
  author = {Suri, Karush},
  title = {{PPO Lagrangian in JAX.}},
  url = {https://github.com/karush17/jax-ppo},
  year = {2021}
}

Owner

Karush Suri

Deep Learning Researcher at Huawei Noah's Ark Lab, Toronto.

GitHub Repository

Using deep learning model to detect breast cancer.

Breast-Cancer-Detection Breast cancer is the most frequent cancer among women, with around one in every 19 women at risk. The number of cases of breas

1 Feb 13, 2022

Some simple programs built in Python: webcam with cv2 that detects eyes and face, with grayscale filter

Programas en Python Algunos programas simples creados en Python: 📹 Webcam con c

1 Feb 15, 2022

efficient neural audio synthesis in the waveform domain

neural waveshaping synthesis real-time neural audio synthesis in the waveform domain paper • website • colab • audio by Ben Hayes, Charalampos Saitis,

169 Dec 23, 2022

Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity

Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity Indic TTS Samples can be found at https://peter-yh-wu.github.io/cross-

1 Nov 12, 2022

Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)

Skyformer This repository is the official implementation of Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr"om Method (NeurIPS 2021).

46 Sep 20, 2022

Complete U-net Implementation with keras

U Net Lowered with Keras Complete U-net Implementation with keras Original Paper Link : https://arxiv.org/abs/1505.04597 Special Implementations : The

14 Oct 10, 2022

Implementation of the GBST block from the Charformer paper, in Pytorch

Charformer - Pytorch Implementation of the GBST (gradient-based subword tokenization) module from the Charformer paper, in Pytorch. The paper proposes

105 Dec 26, 2022

[ICCV 2021] HRegNet: A Hierarchical Network for Large-scale Outdoor LiDAR Point Cloud Registration

HRegNet: A Hierarchical Network for Large-scale Outdoor LiDAR Point Cloud Registration Introduction The repository contains the source code and pre-tr

55 Dec 14, 2022

Potato Disease Classification - Training, Rest APIs, and Frontend to test.

Potato Disease Classification Setup for Python: Install Python (Setup instructions) Install Python packages pip3 install -r training/requirements.txt

95 Dec 21, 2022

Repo 4 basic seminar §How to make human machine readable"

WORK IN PROGRESS... Notebooks from the Seminar: Human Machine Readable WS21/22 Introduction into programming Georg Trogemann, Christian Heck, Mattis

3 May 29, 2022

Animate molecular orbital transitions using Psi4 and Blender

Molecular Orbital Transitions (MOT) Animate molecular orbital transitions using Psi4 and Blender Author: Maximilian Paradiz Dominguez, University of A

3 Feb 01, 2022

Establishing Strong Baselines for TripClick Health Retrieval; ECIR 2022

TripClick Baselines with Improved Training Data Welcome 🙌 to the hub-repo of our paper: Establishing Strong Baselines for TripClick Health Retrieval

3 Nov 03, 2022

Copy Paste positive polyp using poisson image blending for medical image segmentation

Copy Paste positive polyp using poisson image blending for medical image segmentation According poisson image blending I've completely used it for bio

2 Oct 19, 2021

Official PyTorch implementation of CAPTRA: CAtegory-level Pose Tracking for Rigid and Articulated Objects from Point Clouds

CAPTRA: CAtegory-level Pose Tracking for Rigid and Articulated Objects from Point Clouds Introduction This is the official PyTorch implementation of o

96 Dec 07, 2022

On-device wake word detection powered by deep learning.

Porcupine Made in Vancouver, Canada by Picovoice Porcupine is a highly-accurate and lightweight wake word engine. It enables building always-listening

2.8k Dec 29, 2022

Official repository for GCR rerank, a GCN-based reranking method for both image and video re-ID

53 Nov 22, 2022

a reccurrent neural netowrk that when trained on a peice of text and fed a starting prompt will write its on 250 character text using LSTM layers

RNN-Playwrite a reccurrent neural netowrk that when trained on a peice of text and fed a starting prompt will write its on 250 character text using LS

1 Oct 29, 2021

PPO Lagrangian in JAX

Related tags

Overview

PPO Lagrangian in JAX

Usage

Citation

Owner

Karush Suri

Using deep learning model to detect breast cancer.

Some simple programs built in Python: webcam with cv2 that detects eyes and face, with grayscale filter

efficient neural audio synthesis in the waveform domain

Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity

Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)

Complete U-net Implementation with keras

Implementation of the GBST block from the Charformer paper, in Pytorch

[ICCV 2021] HRegNet: A Hierarchical Network for Large-scale Outdoor LiDAR Point Cloud Registration

Potato Disease Classification - Training, Rest APIs, and Frontend to test.

Repo 4 basic seminar §How to make human machine readable"

Animate molecular orbital transitions using Psi4 and Blender

Establishing Strong Baselines for TripClick Health Retrieval; ECIR 2022

Copy Paste positive polyp using poisson image blending for medical image segmentation

Official PyTorch implementation of CAPTRA: CAtegory-level Pose Tracking for Rigid and Articulated Objects from Point Clouds

On-device wake word detection powered by deep learning.

Official repository for GCR rerank, a GCN-based reranking method for both image and video re-ID

a reccurrent neural netowrk that when trained on a peice of text and fed a starting prompt will write its on 250 character text using LSTM layers

PyTorch implementation of Octave Convolution with pre-trained Oct-ResNet and Oct-MobileNet models

Robustness between the worst and average case

Code for the paper "Relation of the Relations: A New Formalization of the Relation Extraction Problem"