RRL: Resnet as representation for Reinforcement Learning

Last update: Dec 07, 2022

Related tags

Overview

Quick Links

RRL: Resnet as representation for Reinforcement Learning

Resnet as representation for Reinforcement Learning (RRL) is a simple yet effective approach for training behaviors directly from visual inputs. We demonstrate that features learned by standard image classification models are general towards different task, robust to visual distractors, and when used in conjunction with standard Imitation Learning or Reinforcement Learning pipelines can efficiently acquire behaviors directly from proprioceptive inputs.

Final Behaviors acquired using RRL on ADROIT benchmark tasks (left to right) (a) Opening a door (b) Hammering a nail (c) Pen-twirling (d)) Object relocation

Setup

RRL codebase can be installed by cloning this repository. Note that it uses git submodules to resolve dependencies. Please follow the steps as below to install correctly.

Clone this repository along with the submodules

git clone --recursive https://github.com/facebookresearch/RRL.git

Install the package using conda. The dependencies (apart from mujoco_py) are listed in env.yml
```
conda env create -f env.yml

conda activate rrl
```
The environment require MuJoCo as a dependency. You may need to obtain a license and follow the setup instructions for mujoco_py. Setting up mujoco_py with GPU support is highly recommended.

Install mj_envs and mjrl repositories.

cd RRL
pip install -e mjrl/.
pip install -e mj_envs/.
pip install -e .

Additionally, it requires the demonstrations published by hand_dapg

Running Instructions

First step is to convert the observations of demonstrations provided by hand_dapg to the encoder feature space. An example script is provided here. Note the script saves the demonstrations in a .pickle format inside the rrl/demonstrations directory.

For the mj_envs tasks :

python convertDemos.py --env_name hammer-v0 --encoder_type resnet34 -c top -d

python convertDemos.py --env_name door-v0 --encoder_type resnet34 -c top -d

python convertDemos.py --env_name pen-v0 --encoder_type resnet34 -c vil_camera -d

python convertDemos.py --env_name relocate-v0 --encoder_type resnet34 -c cam1 -c cam2 -c cam3 -d

Launching RRL experiments using DAPG.

An example launching script is provided job_script.py in the examples/ directory and the configs used are stored in the examples/config/ directory. Note : Hydra configs are used.

python job_script.py  demo_file=
     
       --config-name hammer_dapg

python job_script.py  demo_file=
     
       --config-name door_dapg

python job_script.py  demo_file=
     
       --config-name pen_dapg

python job_script.py  demo_file=
     
       --config-name relocate_dapg

RRL: Resnet as representation for Reinforcement Learning

Related tags

Overview

Quick Links

RRL: Resnet as representation for Reinforcement Learning

Setup

Running Instructions

Owner

Meta Research

codes for Image Inpainting with External-internal Learning and Monochromic Bottleneck

Lightweight tool to perform MITM attack on local network

Definition of a business problem according to Wilson Lower Bound Score and Time Based Average Rating

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

STEAL - Learning Semantic Boundaries from Noisy Annotations (CVPR 2019)

Data loaders and abstractions for text and NLP

DeepRec is a recommendation engine based on TensorFlow.

Resilience from Diversity: Population-based approach to harden models against adversarial attacks

A parametric soroban written with CADQuery.

Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising

A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.

A sample pytorch Implementation of ACL 2021 research paper "Learning Span-Level Interactions for Aspect Sentiment Triplet Extraction".

Official implementation for (Refine Myself by Teaching Myself : Feature Refinement via Self-Knowledge Distillation, CVPR-2021)

Dense Gaussian Processes for Few-Shot Segmentation

In this project we investigate the performance of the SetCon model on realistic video footage. Therefore, we implemented the model in PyTorch and tested the model on two example videos.

Source code for CVPR 2020 paper "Learning to Forget for Meta-Learning"

List some popular DeepFake models e.g. DeepFake, FaceSwap-MarekKowal, IPGAN, FaceShifter, FaceSwap-Nirkin, FSGAN, SimSwap, CihaNet, etc.

ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training

Code release for the paper “Worldsheet Wrapping the World in a 3D Sheet for View Synthesis from a Single Image”, ICCV 2021.

PyTorch implementation of Deep HDR Imaging via A Non-Local Network (TIP 2020).