AAAI-22 paper: SimSR: Simple Distance-based State Representationfor Deep Reinforcement Learning

Last update: Dec 19, 2022

Related tags

Overview

SimSR

Code and dataset for the paper SimSR: Simple Distance-based State Representationfor Deep Reinforcement Learning (AAAI-22).

Requirements

We assume you have access to a gpu that can run CUDA 11. All of the dependencies are in the conda_env.yml file.

conda env create -f conda_env.yml

After the instalation ends you can activate your environment with

conda activate simsr

Instructions

To train a SimSR agent on the cartpole swingup task from image-based observations run bash run.sh from the root of this directory. The run.sh file contains the following command, which you can modify to try different environments / hyperparamters.

DOMAIN=cartpole
TASK=swingup
SEED=1

MUJOCO_GL="egl" CUDA_VISIBLE_DEVICES=0 nohup python -u train.py \
	--domain_name ${DOMAIN} \
	--task_name ${TASK} \
	--encoder_type pixel \
	--action_repeat 4 \
	--pre_transform_image_size 84 \
	--image_size 84 \
	--work_dir ./tmp \
	--agent simsr_sac \
	--frame_stack 3\
	--seed ${SEED} --critic_lr 1e-3 \
	--actor_lr 1e-3 \
	--eval_freq 10000 \
	--batch_size 128 \
	--num_train_steps 260000 > ${DOMAIN}_${TASK}_${SEED}.log &

Note that the MuJoCo Python bindings support three different OpenGL rendering backends: "glfw", "egl", or "osmesa". You can also specify a particular backend to use by setting the MUJOCO_GL= environment variable to one of them.

To visualize progress with tensorboard run:

tensorboard --logdir ./path/to/your/log --port 6006

References

Please cite the paper SimSR: Simple Distance-based State Representationfor Deep Reinforcement Learning if you found the resources in the repository useful.

AAAI-22 paper: SimSR: Simple Distance-based State Representationfor Deep Reinforcement Learning

Related tags

Overview

SimSR

Requirements

Instructions

References

Owner

Code for Talk-to-Edit (ICCV2021). Paper: Talk-to-Edit: Fine-Grained Facial Editing via Dialog.

LETR: Line Segment Detection Using Transformers without Edges

Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021

Simple codebase for flexible neural net training

MADT: Offline Pre-trained Multi-Agent Decision Transformer

Open-Set Recognition: A Good Closed-Set Classifier is All You Need

PyTorch implementation of UPFlow (unsupervised optical flow learning)

Phonetic PosteriorGram (PPG)-Based Voice Conversion (VC)

Visual Memorability for Robotic Interestingness via Unsupervised Online Learning (ECCV 2020 Oral and TRO)

This is a pytorch implementation of the NeurIPS paper GAN Memory with No Forgetting.

TVNet: Temporal Voting Network for Action Localization

💊 A 3D Generative Model for Structure-Based Drug Design (NeurIPS 2021)

Predicting Tweet Sentiment Maching Learning and streamlit

Arxiv harvester - Poor man's simple harvester for arXiv resources

A PyTorch implementation for V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation

a dnn ai project to classify which food people are eating on audio recordings

PyAF is an Open Source Python library for Automatic Time Series Forecasting built on top of popular pydata modules.

Embeds a story into a music playlist by sorting the playlist so that the order of the music follows a narrative arc.

Riemannian Convex Potential Maps

For AILAB: Cross Lingual Retrieval on Yelp Search Engine