Robot Reinforcement Learning on the Constraint Manifold

Last update: Dec 05, 2022

Related tags

Deep Learning rl_on_manifold

Overview

Acting on the Tangent Space of the Constraint Manifold

Implementation of "Robot Reinforcement Learning on the Constraint Manifold"

[paper] [website]

Install

pip install -e .

Run Examples

cd examples

CircularMotion Environment.

Environment options [A, E, T], algorithms options [TRPO, PPO, SAC, DDPG, TD3]

python circle_exp.py --render --env A --alg TRPO

PlanarAirHockey Environment.

Environment options [H, D, UH, UD], algorithms options [TRPO, PPO, SAC, DDPG, TD3]

python planar_air_hockey_exp.py --debug-gui --env H --alg SAC

IiwaAirHockey Environment.

Environment options [7H, RMP], algorithms options [TRPO, PPO, SAC, DDPG, TD3]

python iiwa_air_hockey_exp.py --debug-gui --env 7H --alg SAC

CollisionAvoidance Environment.

Environment options [C], algorithms options [TRPO, PPO, SAC, DDPG, TD3]

python collision_avoidance_exp.py --render --env C --alg SAC

Bibtex

@inproceedings{CORL_2021_Learning_on_the_Manifold,
  author =      "Liu, P. and  Tateo D. and  Bou-Ammar, H. and  Peters, J.",
  year =        "2021",
  title =       "Robot Reinforcement Learning on the Constraint Manifold",
  booktitle =   "Proceedings of the Conference on Robot Learning (CoRL)",
  key =	        "robot learning, constrained reinforcement learning, safe exploration",
}

Robot Reinforcement Learning on the Constraint Manifold

Related tags

Overview

Acting on the Tangent Space of the Constraint Manifold

Install

Run Examples

CircularMotion Environment.

PlanarAirHockey Environment.

IiwaAirHockey Environment.

CollisionAvoidance Environment.

Bibtex

Owner

Pre-Trained Image Processing Transformer (IPT)

Head2Toe: Utilizing Intermediate Representations for Better OOD Generalization

Implementation of paper: "Image Super-Resolution Using Dense Skip Connections" in PyTorch

Official code for "Towards An End-to-End Framework for Flow-Guided Video Inpainting" (CVPR2022)

Awesome Remote Sensing Toolkit based on PaddlePaddle.

Repo for paper "Dynamic Placement of Rapidly Deployable Mobile Sensor Robots Using Machine Learning and Expected Value of Information"

PyTorch implementations of algorithms for density estimation

Reinforcement Learning via Supervised Learning

Official implementation of Monocular Quasi-Dense 3D Object Tracking

Implementation of "Debiasing Item-to-Item Recommendations With Small Annotated Datasets" (RecSys '20)

Code for ICLR2018 paper: Improving GAN Training via Binarized Representation Entropy (BRE) Regularization - Y. Cao · W Ding · Y.C. Lui · R. Huang

PyTorch code accompanying the paper "Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning" (NeurIPS 2021).

A Temporal Extension Library for PyTorch Geometric

Ranking Models in Unlabeled New Environments （iccv21）

Neural Turing Machine (NTM) & Differentiable Neural Computer (DNC) with pytorch & visdom

Official implementation of UTNet: A Hybrid Transformer Architecture for Medical Image Segmentation

Enhancing Aspect-Based Sentiment Analysis with Supervised Contrastive Learning.

Solve a Rubiks Cube using Python Opencv and Kociemba module

Official implementation of our paper "Learning to Bootstrap for Combating Label Noise"

Exadel CompreFace is a free and open-source face recognition GitHub project