NovelD: A Simple yet Effective Exploration Criterion

Intro

This is an implementation of the method proposed in

NovelD: A Simple yet Effective Exploration Criterion and BeBold: Exploration Beyond the Boundary of Explored Regions

Citation

If you use this code in your own work, please cite our paper:

@article{zhang2021noveld,
  title={NovelD: A Simple yet Effective Exploration Criterion},
  author={Zhang, Tianjun and Xu, Huazhe and Wang, Xiaolong and Wu, Yi and Keutzer, Kurt and Gonzalez, Joseph E and Tian, Yuandong},
  journal={Advances in Neural Information Processing Systems},
  volume={34},
  year={2021}
}

@article{zhang2020bebold,
  title={BeBold: Exploration Beyond the Boundary of Explored Regions},
  author={Zhang, Tianjun and Xu, Huazhe and Wang, Xiaolong and Wu, Yi and Keutzer, Kurt and Gonzalez, Joseph E and Tian, Yuandong},
  journal={arXiv preprint arXiv:2012.08621},
  year={2020}
}

Installation

# Install Instructions
conda create -n ride python=3.7
conda activate noveld 
git clone [email protected]:tianjunz/NovelD.git
cd NovelD
pip install -r requirements.txt

Train NovelD on MiniGrid

OMP_NUM_THREADS=1 python main.py --model bebold --env MiniGrid-ObstructedMaze-2Dlhb-v0 --total_frames 500000000 --intrinsic_reward_coef 0.05 --entropy_cost 0.0005

Acknowledgements

Our vanilla RL algorithm is based on RIDE.

License

This code is under the CC-BY-NC 4.0 (Attribution-NonCommercial 4.0 International) license.

NovelD: A Simple yet Effective Exploration Criterion

Related tags

Overview

NovelD: A Simple yet Effective Exploration Criterion

Intro

Citation

Installation

Train NovelD on MiniGrid

Acknowledgements

License

Owner

Circuit Training: An open-source framework for generating chip floor plans with distributed deep reinforcement learning

Angle data is a simple data type.

Collapse by Conditioning: Training Class-conditional GANs with Limited Data

"Graph Neural Controlled Differential Equations for Traffic Forecasting", AAAI 2022

InterFaceGAN - Interpreting the Latent Space of GANs for Semantic Face Editing

Transferable Unrestricted Attacks, which won 1st place in CVPR’21 Security AI Challenger: Unrestricted Adversarial Attacks on ImageNet.

Anonymize BLM Protest Images

LocUNet is a deep learning method to localize a UE based solely on the reported signal strengths from a set of BSs.

Dark Finix: All in one hacking framework with almost 100 tools

Mae segmentation - Reproduction of semantic segmentation using masked autoencoder (mae)

Visualize Camera's Pose Using Extrinsic Parameter by Plotting Pyramid Model on 3D Space

Fast and customizable reconnaissance workflow tool based on simple YAML based DSL.

A toolset of Python programs for signal modeling and indentification via sparse semilinear autoregressors.

TAUFE: Task-Agnostic Undesirable Feature DeactivationUsing Out-of-Distribution Data

Parsing, analyzing, and comparing source code across many languages

library for nonlinear optimization, wrapping many algorithms for global and local, constrained or unconstrained, optimization

Compositional Sketch Search

Really awesome semantic segmentation

Easily benchmark PyTorch model FLOPs, latency, throughput, max allocated memory and energy consumption

Multiple Object Tracking with Yolov5!