Discovering and Achieving Goals via World Models

Last update: Dec 22, 2022

Related tags

Overview

Discovering and Achieving Goals via World Models

[Project Website] [Benchmark Code] [Video (2min)] [Oral Talk (13min)] [Paper]

Russell Mendonca*¹, Oleh Rybkin*², Kostas Daniilidis², Danijar Hafner^3,4, Deepak Pathak¹
(* equal contribution, random order)

¹Carnegie Mellon University
²University of Pennsylvania
³Google Research, Brain Team
⁴University of Toronto

Official implementation of the Lexa agent from the paper Discovering and Achieving Goals via World Models.

Setup

Create the conda environment by running :

conda env create -f environment.yml

Clone the lexa-benchmark repo, and modify the python path
export PYTHONPATH= /lexa:

Export the following variables for rendering
export MUJOCO_RENDERER=egl; export MUJOCO_GL=egl

Training

First source the environment : source activate lexa

For training, run :

export CUDA_VISIBLE_DEVICES=
   
      
python train.py --configs defaults 
    
      --task 
     
       --logdir

where method can be lexa_temporal, lexa_cosine, ddl, diayn or gcsl
Supported tasks are dmc_walker_walk, dmc_quadruped_run, robobin, kitchen, joint

To view the graphs and gifs during training, run tensorboard --logdir

Bibtex

If you find this code useful, please cite:

@misc{lexa2021,
    title={Discovering and Achieving Goals via World Models},
    author={Mendonca, Russell and Rybkin, Oleh and
    Daniilidis, Kostas and Hafner, Danijar and Pathak, Deepak},
    year={2021},
    Booktitle={NeurIPS}
}

Acknowledgements

This code was developed using Dreamer V2 and Plan2Explore.

Discovering and Achieving Goals via World Models

Related tags

Overview

Discovering and Achieving Goals via World Models

[Project Website] [Benchmark Code] [Video (2min)] [Oral Talk (13min)] [Paper]

Setup

Training

Bibtex

Acknowledgements

Owner

Oleg Rybkin

Membership Inference Attack against Graph Neural Networks

🌾 PASTIS 🌾 Panoptic Agricultural Satellite TIme Series

GEP (GDB Enhanced Prompt) - a GDB plug-in for GDB command prompt with fzf history search, fish-like autosuggestions, auto-completion with floating window, partial string matching in history, and more!

Linescanning - Package for (pre)processing of anatomical and (linescanning) fMRI data

Interpretation of T cell states using reference single-cell atlases

This library provides an abstraction to perform Model Versioning using Weight & Biases.

Final project for machine learning (CSC 590). Detection of hepatitis C and progression through blood samples.

PyElastica is the Python implementation of Elastica, an open-source software for the simulation of assemblies of slender, one-dimensional structures using Cosserat Rod theory.

Taming Transformers for High-Resolution Image Synthesis

Code for the ICML 2021 paper "Bridging Multi-Task Learning and Meta-Learning: Towards Efficient Training and Effective Adaptation", Haoxiang Wang, Han Zhao, Bo Li.

Effective Use of Transformer Networks for Entity Tracking

PCACE: A Statistical Approach to Ranking Neurons for CNN Interpretability

Source code for paper: Knowledge Inheritance for Pre-trained Language Models

"Moshpit SGD: Communication-Efficient Decentralized Training on Heterogeneous Unreliable Devices", official implementation

A Python package for causal inference using Synthetic Controls

PiRapGenerator - Make anyone rap the digits of pi

Baselines for TrajNet++

基于Paddle框架的fcanet复现

🔎 Super-scale your images and run experiments with Residual Dense and Adversarial Networks.

The repository offers the official implementation of our paper in PyTorch.