Multimodal Reinforcement Learning

JAX implementations of the following multimodal reinforcement learning approaches.

Dual-coding Episodic Memory from "Grounded Language Learning Fast and Slow"

The goal in this setting is for the agent to be presented with multiple objects with made up names following "This is a _____" statements and to then carry out an instruction such as "Move the wazzle to the table." This task requires the agent to learn long-term language and vision representations for concepts like "This is a" and objects that carry over between episodes such as "table" while also being able to learn one-shot representations of novel objects and their names.

Usage

Start by setting up the environment locally by running

poetry install
poetry shell

The learning environment depends on Docker and requires that the Docker Desktop program is running (on Mac). Once that's done you can run the default environment (fast mapping with 3 objects from the paper).

python fast_slow_learning/main.py

Solving reinforcement learning tasks which require language and vision

Related tags

Overview

Multimodal Reinforcement Learning

Usage

Owner

Henry Prior

(JMLR' 19) A Python Toolbox for Scalable Outlier Detection (Anomaly Detection)

Recurrent Neural Network Tutorial, Part 2 - Implementing a RNN in Python and Theano

A study project using the AA-RMVSNet to reconstruct buildings from multiple images

Reducing Information Bottleneck for Weakly Supervised Semantic Segmentation (NeurIPS 2021)

Code for the paper "How Attentive are Graph Attention Networks?"

IAST: Instance Adaptive Self-training for Unsupervised Domain Adaptation (ECCV 2020)

Exploring Classification Equilibrium in Long-Tailed Object Detection, ICCV2021

ELSED: Enhanced Line SEgment Drawing

PyTorch Implementation of Small Lesion Segmentation in Brain MRIs with Subpixel Embedding (ORAL, MICCAIW 2021)

Moon-patrol - A faithful recreation of the 1983 hit classic Moon Patrol for the Atari 2600 created using the Pygame library for Python

Keras implementation of Normalizer-Free Networks and SGD - Adaptive Gradient Clipping

A PyTorch implementation of "SelfGNN: Self-supervised Graph Neural Networks without explicit negative sampling"

codebase for "A Theory of the Inductive Bias and Generalization of Kernel Regression and Wide Neural Networks"

a reimplementation of Holistically-Nested Edge Detection in PyTorch

Lightweight stereo matching network based on MobileNetV1 and MobileNetV2

Doge-Prediction - Coding Club prediction ig

A minimalist environment for decision-making in autonomous driving

Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)

Code for the paper: Fighting Fake News: Image Splice Detection via Learned Self-Consistency