Multimodal Reinforcement Learning

JAX implementations of the following multimodal reinforcement learning approaches.

Dual-coding Episodic Memory from "Grounded Language Learning Fast and Slow"

The goal in this setting is for the agent to be presented with multiple objects with made up names following "This is a _____" statements and to then carry out an instruction such as "Move the wazzle to the table." This task requires the agent to learn long-term language and vision representations for concepts like "This is a" and objects that carry over between episodes such as "table" while also being able to learn one-shot representations of novel objects and their names.

Usage

Start by setting up the environment locally by running

poetry install
poetry shell

The learning environment depends on Docker and requires that the Docker Desktop program is running (on Mac). Once that's done you can run the default environment (fast mapping with 3 objects from the paper).

python fast_slow_learning/main.py

Solving reinforcement learning tasks which require language and vision

Related tags

Overview

Multimodal Reinforcement Learning

Usage

Owner

Henry Prior

Implementation of CVPR'2022:Surface Reconstruction from Point Clouds by Learning Predictive Context Priors

Noise Conditional Score Networks (NeurIPS 2019, Oral)

Weakly Supervised Posture Mining with Reverse Cross-entropy for Fine-grained Classification

(CVPR2021) ClassSR: A General Framework to Accelerate Super-Resolution Networks by Data Characteristic

CaFM-pytorch ICCV ACCEPT Introduction of dataset VSD4K

Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.

Consumer Fairness in Recommender Systems: Contextualizing Definitions and Mitigations

This is an official implementation of our CVPR 2021 paper "Bottom-Up Human Pose Estimation Via Disentangled Keypoint Regression" (https://arxiv.org/abs/2104.02300)

On the Analysis of French Phonetic Idiosyncrasies for Accent Recognition

details on efforts to dump the Watermelon Games Paprium cart

Example for AUAV 2022 with obstacle avoidance.

Live training loss plot in Jupyter Notebook for Keras, PyTorch and others

Additional environments compatible with OpenAI gym

DeFMO: Deblurring and Shape Recovery of Fast Moving Objects (CVPR 2021)

CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image Segmentation

Robust Lane Detection via Expanded Self Attention (WACV 2022)

This repository contains the code and models necessary to replicate the results of paper: How to Robustify Black-Box ML Models? A Zeroth-Order Optimization Perspective

Keras like implementation of Deep Learning architectures from scratch using numpy.

Official repository with code and data accompanying the NAACL 2021 paper "Hurdles to Progress in Long-form Question Answering" (https://arxiv.org/abs/2103.06332).

An implementation of DeepMind's Relational Recurrent Neural Networks in PyTorch.