Discovering and Achieving Goals via World Models

Last update: Dec 22, 2022

Related tags

Overview

Discovering and Achieving Goals via World Models

[Project Website] [Benchmark Code] [Video (2min)] [Oral Talk (13min)] [Paper]

Russell Mendonca*¹, Oleh Rybkin*², Kostas Daniilidis², Danijar Hafner^3,4, Deepak Pathak¹
(* equal contribution, random order)

¹Carnegie Mellon University
²University of Pennsylvania
³Google Research, Brain Team
⁴University of Toronto

Official implementation of the Lexa agent from the paper Discovering and Achieving Goals via World Models.

Setup

Create the conda environment by running :

conda env create -f environment.yml

Clone the lexa-benchmark repo, and modify the python path
export PYTHONPATH= /lexa:

Export the following variables for rendering
export MUJOCO_RENDERER=egl; export MUJOCO_GL=egl

Training

First source the environment : source activate lexa

For training, run :

export CUDA_VISIBLE_DEVICES=
   
      
python train.py --configs defaults 
    
      --task 
     
       --logdir

where method can be lexa_temporal, lexa_cosine, ddl, diayn or gcsl
Supported tasks are dmc_walker_walk, dmc_quadruped_run, robobin, kitchen, joint

To view the graphs and gifs during training, run tensorboard --logdir

Bibtex

If you find this code useful, please cite:

@misc{lexa2021,
    title={Discovering and Achieving Goals via World Models},
    author={Mendonca, Russell and Rybkin, Oleh and
    Daniilidis, Kostas and Hafner, Danijar and Pathak, Deepak},
    year={2021},
    Booktitle={NeurIPS}
}

Acknowledgements

This code was developed using Dreamer V2 and Plan2Explore.

Discovering and Achieving Goals via World Models

Related tags

Overview

Discovering and Achieving Goals via World Models

[Project Website] [Benchmark Code] [Video (2min)] [Oral Talk (13min)] [Paper]

Setup

Training

Bibtex

Acknowledgements

Owner

Oleg Rybkin

Generalized Data Weighting via Class-level Gradient Manipulation

Code repo for "RBSRICNN: Raw Burst Super-Resolution through Iterative Convolutional Neural Network" (Machine Learning and the Physical Sciences workshop in NeurIPS 2021).

Satellite labelling tool for manual labelling of storm top features such as overshooting tops, above-anvil plumes, cold U/Vs, rings etc.

Official implementation for “Unsupervised Low-Light Image Enhancement via Histogram Equalization Prior”

一个免费开源一键搭建的通用验证码识别平台，大部分常见的中英数验证码识别都没啥问题。

The codes and related files to reproduce the results for Image Similarity Challenge Track 2.

🔥 Cogitare - A Modern, Fast, and Modular Deep Learning and Machine Learning framework for Python

Official Implementation for "ReStyle: A Residual-Based StyleGAN Encoder via Iterative Refinement" https://arxiv.org/abs/2104.02699

The source code for CATSETMAT: Cross Attention for Set Matching in Bipartite Hypergraphs

Retrieve and analysis data from SDSS (Sloan Digital Sky Survey)

Code and models for "Pano3D: A Holistic Benchmark and a Solid Baseline for 360 Depth Estimation", OmniCV Workshop @ CVPR21.

PyTorch Implementation of Sparse DETR

nfelo: a power ranking, prediction, and betting model for the NFL

"3D Human Texture Estimation from a Single Image with Transformers", ICCV 2021

Self-supervised Label Augmentation via Input Transformations (ICML 2020)

QRec: A Python Framework for quick implementation of recommender systems (TensorFlow Based)

i-SpaSP: Structured Neural Pruning via Sparse Signal Recovery

SurfEmb (CVPR 2022) - SurfEmb: Dense and Continuous Correspondence Distributions

Project of 'TBEFN: A Two-branch Exposure-fusion Network for Low-light Image Enhancement '

Training PSPNet in Tensorflow. Reproduce the performance from the paper.