(NeurIPS '21 Spotlight) IQ-Learn: Inverse Q-Learning for Imitation

Last update: Dec 20, 2022

Related tags

Overview

Inverse Q-Learning (IQ-Learn)

Official code base for IQ-Learn: Inverse soft-Q Learning for Imitation, NeurIPS '21 Spotlight

IQ-Learn is an easy-to-use algorithm that's a drop-in replacement to methods like Behavior Cloning and GAIL, to boost your imitation learning pipelines!
Update: IQ-Learn was recently used to create the best AI agent for playing Minecraft. Placing #1 in NeurIPS MineRL Basalt Challenge using only human demos (Overall Leaderboard Rank #2)

[Project Page]

We introduce Inverse Q-Learning (IQ-Learn), a state-of-the-art novel framework for Imitation Learning (IL), that directly learns soft-Q functions from expert data. IQ-Learn enables non-adverserial imitation learning, working on both offline and online IL settings. It is performant even with very sparse expert data, and scales to complex image-based environments, surpassing prior methods by more than 3x. It is very simple to implement requiring ~15 lines of code on top of existing RL methods.

Inverse Q-Learning is theoretically equivalent to Inverse Reinforcement learning, i.e. learning rewards from expert data. However, it is much more powerful in practice. It admits very simple non-adverserial training and works on complete offline IL settings (without any access to the environment), greatly exceeding Behavior Cloning.

IQ-Learn is the successor to Adversarial Imitation Learning methods like GAIL (coming from the same lab).
It extends the theoretical framework for Inverse RL to non-adverserial and scalable learning, for the first-time showing guaranteed convergence.

Citation

@inproceedings{garg2021iqlearn,
title={IQ-Learn: Inverse soft-Q Learning for Imitation},
author={Divyansh Garg and Shuvam Chakraborty and Chris Cundy and Jiaming Song and Stefano Ermon},
booktitle={Thirty-Fifth Conference on Neural Information Processing Systems},
year={2021},
url={https://openreview.net/forum?id=Aeo-xqtb5p}
}

Key Advantages

✅ Drop-in replacement to Behavior Cloning
✅ Non-adverserial online IL (Successor to GAIL & AIRL)
✅ Simple to implement
✅ Performant with very sparse data (single expert demo)
✅ Scales to Complex Image Envs (SOTA on Atari and playing Minecraft)
✅ Recover rewards from envs

Usage

To install and use IQ-Learn check the instructions provided in the iq_learn folder.

Imitation

Reaching human-level performance on Atari with pure imitation:

Rewards

Recovering environment rewards on GridWorld:

Questions

Please feel free to email us if you have any questions.

Div Garg ([email protected])

(NeurIPS '21 Spotlight) IQ-Learn: Inverse Q-Learning for Imitation

Related tags

Overview

Inverse Q-Learning (IQ-Learn)

Citation

Key Advantages

Usage

Imitation

Rewards

Questions

Owner

Divyansh Garg

A pytorch &keras implementation and demo of Fastformer.

PyTorch implementation DRO: Deep Recurrent Optimizer for Structure-from-Motion

PyContinual (An Easy and Extendible Framework for Continual Learning)

Large-scale Hyperspectral Image Clustering Using Contrastive Learning, CIKM 21 Workshop

Unsupervised Semantic Segmentation by Contrasting Object Mask Proposals.

Survival analysis in Python

Tutorial in Python targeted at Epidemiologists. Will discuss the basics of analysis in Python 3

A map update dataset and benchmark

A visualization tool to show a TensorFlow's graph like TensorBoard

Hypernetwork-Ensemble Learning of Segmentation Probability for Medical Image Segmentation with Ambiguous Labels

Lab course materials for IEMBA 8/9 course "Coding and Artificial Intelligence"

A simple Rock-Paper-Scissors game using CV in python

Disagreement-Regularized Imitation Learning

Using OpenAI's CLIP to upscale and enhance images

BOVText: A Large-Scale, Multidimensional Multilingual Dataset for Video Text Spotting

A collection of metrics for evaluating timbre dissimilarity using the TorchMetrics API

fastgradio is a python library to quickly build and share gradio interfaces of your trained fastai models.

The code for 'Deep Residual Fourier Transformation for Single Image Deblurring'

Cancer-and-Tumor-Detection-Using-Inception-model - In this repo i am gonna show you how i did cancer/tumor detection in lungs using deep neural networks, specifically here the Inception model by google.

Customizable RecSys Simulator for OpenAI Gym