Offline Reinforcement Learning with Implicit Q-Learning

This repository contains the official implementation of Offline Reinforcement Learning with Implicit Q-Learning by Ilya Kostrikov, Ashvin Nair, and Sergey Levine.

If you use this code for your research, please consider citing the paper:

@article{kostrikov2021iql,
    title={Offline Reinforcement Learning with Implicit Q-Learning},
    author={Ilya Kostrikov and Ashvin Nair and Sergey Levine},
    year={2021},
    archivePrefix={arXiv},
    primaryClass={cs.LG}
}

How to run the code

Install dependencies

pip install -r requirements.txt

See instructions for CUDA.

Run training

Locomotion

python train_offline.py --env_name=halfcheetah-medium-expert-v2 --config=configs/mujoco_config.py

AntMaze

python train_offline.py --env_name=antmaze-large-play-v0 --config=configs/antmaze_config.py --eval_episodes=100 --eval_interval=100000

Kitchen and Adroit

python train_offline.py --env_name=pen-human-v0 --config=configs/kitchen_config.py

Misc

The implementation is based on JAXRL.

Offline Reinforcement Learning with Implicit Q-Learning

Related tags

Overview

Offline Reinforcement Learning with Implicit Q-Learning

How to run the code

Install dependencies

Run training

Misc

Owner

Ilya Kostrikov

CVPR 2021 - Official code repository for the paper: On Self-Contact and Human Pose.

An implementation of the proximal policy optimization algorithm

"Neural Turing Machine" in Tensorflow

tinykernel - A minimal Python kernel so you can run Python in your Python

Source code for "Pack Together: Entity and Relation Extraction with Levitated Marker"

Code for the Lovász-Softmax loss (CVPR 2018)

FridaHookAppTool - Frida Hook App Tool With Python

Equivariant Imaging: Learning Beyond the Range Space

PyTorch implementation of MoCo v3 for self-supervised ResNet and ViT.

Real-ESRGAN aims at developing Practical Algorithms for General Image Restoration.

mmfewshot is an open source few shot learning toolbox based on PyTorch

Face recognition system using MTCNN, FACENET, SVM and FAST API to track participants of Big Brother Brasil in real time.

Official code for article "Expression is enough: Improving traﬀic signal control with advanced traﬀic state representation"

PyTorch experiments with the Zalando fashion-mnist dataset

I will implement Fastai in each projects present in this repository.

GAT - Graph Attention Network (PyTorch) 💻 + graphs + 📣 = ❤️

A project studying the influence of communication in multi-objective normal-form games

[NeurIPS'20] Multiscale Deep Equilibrium Models

LeetCode Solutions https://t.me/tenvlad

METER: Multimodal End-to-end TransformER