Dynamic Bottleneck for Robust Self-Supervised Exploration

Last update: Nov 14, 2022

Related tags

Deep Learning DB

Overview

Dynamic Bottleneck

Introduction

This is a TensorFlow based implementation for our paper on

"Dynamic Bottleneck for Robust Self-Supervised Exploration". NeurIPS 2021

Prerequisites

python3.6 or 3.7, tensorflow-gpu 1.x, tensorflow-probability, openAI baselines, openAI Gym

Installation and Usage

Atari games

The following command should train a pure exploration agent on "Breakout" with default experiment parameters.

python run.py --env BreakoutNoFrameskip-v4

Atari games with Random-Box noise

The following command should train a pure exploration agent on "Breakout" with randomBox noise.

python run.py --env BreakoutNoFrameskip-v4 --randomBoxNoise

Atari games with Gaussian noise

The following command should train a pure exploration agent on "Breakout" with Gaussian noise.

python run.py --env BreakoutNoFrameskip-v4 --pixelNoise

Atari games with sticky actions

The following command should train a pure exploration agent on "sticky Breakout" with a probability of 0.25

python run.py --env BreakoutNoFrameskip-v4 --stickyAtari

Baselines

ICM: We use the official code of "Curiosity-driven Exploration by Self-supervised Prediction, ICML 2017" and "Large-Scale Study of Curiosity-Driven Learning, ICLR 2019".
Disagreement: We use the official code of "Self-Supervised Exploration via Disagreement, ICML 2019".
CB: We use the official code of "Curiosity-Bottleneck: Exploration by Distilling Task-Specific Novelty, ICML 2019".

Dynamic Bottleneck for Robust Self-Supervised Exploration

Related tags

Overview

Dynamic Bottleneck

Introduction

Prerequisites

Installation and Usage

Atari games

Atari games with Random-Box noise

Atari games with Gaussian noise

Atari games with sticky actions

Baselines

Owner

Bai Chenjia

Unsupervised Feature Loss (UFLoss) for High Fidelity Deep learning (DL)-based reconstruction

Source code for Acorn, the precision farming rover by Twisted Fields

Generating retro pixel game characters with Generative Adversarial Networks. Dataset "TinyHero" included.

A PyTorch Implementation of SphereFace.

x-transformers-paddle 2.x version

Procedural 3D data generation pipeline for architecture

GEA - Code for Guided Evolution for Neural Architecture Search

bio_inspired_min_nets_improve_the_performance_and_robustness_of_deep_networks

(ICCV 2021 Oral) Re-distributing Biased Pseudo Labels for Semi-supervised Semantic Segmentation: A Baseline Investigation.

Can we do Customers Segmentation using PHP and Unsupervized Machine Learning ? Yes we can ! 🤡

This repo holds the code of TransFuse: Fusing Transformers and CNNs for Medical Image Segmentation

This repository includes code of my study about Asynchronous in Frequency domain of GAN images.

Malmo Collaborative AI Challenge - Team Pig Catcher

Much faster than SORT(Simple Online and Realtime Tracking), a little worse than SORT

pytorch implementation for PointNet

Fast and scalable uncertainty quantification for neural molecular property prediction, accelerated optimization, and guided virtual screening.

Exadel CompreFace is a free and open-source face recognition GitHub project

DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing

[ICCV 2021] Focal Frequency Loss for Image Reconstruction and Synthesis

Using python and scikit-learn to make stock predictions