Visual Adversarial Imitation Learning using Variational Models (VMAIL)

This is the official implementation of the NeurIPS 2021 paper.

Method

VMAIL simultaneously learns a variational dynamics model and trains an on-policy adversarial imitation learning algorithm in the latent space using only model-based rollouts. This allows for stable and sample efficient training, as well as zero-shot imitation learning by transfering the learned dynamics model

Instructions

Get dependencies:

conda env create -f vmail.yml
conda activate vmail
cd robel_claw/robel
pip install -e .

To train agents for each environmnet download the expert data from the provided link and run:

python3 -u vmail.py --logdir .logdir --expert_datadir expert_datadir

The training will generate tensorabord plots and GIFs in the log folder:

tensorboard --logdir ./logdir

Citation

If you find this code useful, please reference in your paper:

@article{rafailov2021visual,
      title={Visual Adversarial Imitation Learning using Variational Models}, 
      author={Rafael Rafailov and Tianhe Yu and Aravind Rajeswaran and Chelsea Finn},
      year={2021},
      journal={Neural Information Processing Systems}
}

Visual Adversarial Imitation Learning using Variational Models (VMAIL)

Related tags

Overview

Visual Adversarial Imitation Learning using Variational Models (VMAIL)

Method

Instructions

Citation

Owner

LegoDNN: a block-grained scaling tool for mobile vision systems

This project provides a stock market environment using OpenGym with Deep Q-learning and Policy Gradient.

Code repo for EMNLP21 paper "Zero-Shot Information Extraction as a Unified Text-to-Triple Translation"

Code for Private Recommender Systems: How Can Users Build Their Own Fair Recommender Systems without Log Data? (SDM 2022)

Volsdf - Volume Rendering of Neural Implicit Surfaces

This is the repository for our paper Ditch the Gold Standard: Re-evaluating Conversational Question Answering

MediaPipeのPythonパッケージのサンプルです。2020/12/11時点でPython実装のある4機能(Hands、Pose、Face Mesh、Holistic)について用意しています。

This code is part of the reproducibility package for the SANER 2022 paper "Generating Clarifying Questions for Query Refinement in Source Code Search".

Machine Unlearning with SISA

A crossplatform menu bar application using mpv as DLNA Media Renderer.

COIN the currently largest dataset for comprehensive instruction video analysis.

Stroke-predictions-ml-model - Machine learning model to predict individuals chances of having a stroke

Code for reproducing our paper: LMSOC: An Approach for Socially Sensitive Pretraining

PiCIE: Unsupervised Semantic Segmentation using Invariance and Equivariance in clustering (CVPR2021)

A simple baseline for the 2022 IEEE GRSS Data Fusion Contest (DFC2022)

Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.

PyTorch implementation for "Mining Latent Structures with Contrastive Modality Fusion for Multimedia Recommendation"

Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features

Code for NeurIPS2021 submission "A Surrogate Objective Framework for Prediction+Programming with Soft Constraints"

PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis