Offline Reinforcement Learning with Implicit Q-Learning

This repository contains the official implementation of Offline Reinforcement Learning with Implicit Q-Learning by Ilya Kostrikov, Ashvin Nair, and Sergey Levine.

If you use this code for your research, please consider citing the paper:

@article{kostrikov2021iql,
    title={Offline Reinforcement Learning with Implicit Q-Learning},
    author={Ilya Kostrikov and Ashvin Nair and Sergey Levine},
    year={2021},
    archivePrefix={arXiv},
    primaryClass={cs.LG}
}

How to run the code

Install dependencies

pip install -r requirements.txt

See instructions for CUDA.

Run training

Locomotion

python train_offline.py --env_name=halfcheetah-medium-expert-v2 --config=configs/mujoco_config.py

AntMaze

python train_offline.py --env_name=antmaze-large-play-v0 --config=configs/antmaze_config.py --eval_episodes=100 --eval_interval=100000

Kitchen and Adroit

python train_offline.py --env_name=pen-human-v0 --config=configs/kitchen_config.py

Misc

The implementation is based on JAXRL.

Offline Reinforcement Learning with Implicit Q-Learning

Related tags

Overview

Offline Reinforcement Learning with Implicit Q-Learning

How to run the code

Install dependencies

Run training

Misc

Owner

Ilya Kostrikov

Object Detection with YOLOv3

SANet: A Slice-Aware Network for Pulmonary Nodule Detection

A library for Deep Learning Implementations and utils

A Kitti Road Segmentation model implemented in tensorflow.

Organseg dags - The repository contains the codebase for multi-organ segmentation with directed acyclic graphs (DAGs) in CT.

HTSeq is a Python library to facilitate processing and analysis of data from high-throughput sequencing (HTS) experiments.

'Solving the sampling problem of the Sycamore quantum supremacy circuits

Simple codebase for flexible neural net training

Official implementation of "Accelerating Reinforcement Learning with Learned Skill Priors", Pertsch et al., CoRL 2020

Pop-Out Motion: 3D-Aware Image Deformation via Learning the Shape Laplacian (CVPR 2022)

Irrigation controller for Home Assistant

QTool: A Low-bit Quantization Toolbox for Deep Neural Networks in Computer Vision

JUSTICE: A Benchmark Dataset for Supreme Court’s Judgment Prediction

Interactive Terraform visualization. State and configuration explorer.

Dataset VSD4K includes 6 popular categories: game, sport, dance, vlog, interview and city.

ANN model for prediction a spatio-temporal distribution of supercooled liquid in mixed-phase clouds using Doppler cloud radar spectra.

UT-Sarulab MOS prediction system using SSL models

Compact Bilinear Pooling for PyTorch

This is a collection of all challenges in HKCERT CTF 2021

A vision library for performing sliced inference on large images/small objects