Attention-driven Robotic Manipulation (ARM)

This codebase is home to:

Q-attention: Enabling Efficient Learning for Vision-based Robotic Manipulation

Installation

ARM is trained using the YARR framework. Head to the YARR github page and follow installation instructions.

ARM is evaluated on RLBench 1.1.0. Head to the RLBench github page and follow installation instructions.

Now install project requirements:

pip install -r requirements.txt

Running experiments

Be sure to have RLBench demos saved on your machine before proceeding. To generate demos for a task, go to the tools directory in RLBench (rlbench/tools), and run:

python dataset_generator.py --save_path=/mnt/my/save/dir --tasks=take_lid_off_saucepan --image_size=128,128 \
--renderer=opengl --episodes_per_task=100 --variations=1 --processes=1

Experiments are launched via Hydra. To start training an agent to accomplish take_lid_off_saucepan with the default parameters on gpu 0, then run:

python launch.py method=ARM rlbench.task=take_lid_off_saucepan rlbench.demo_path=/mnt/my/save/dir framework.gpu=0

Attention-driven Robot Manipulation (ARM) which includes Q-attention

Related tags

Overview

Attention-driven Robotic Manipulation (ARM)

Installation

Running experiments

Owner

Stephen James

Code for ICCV 2021 paper: ARAPReg: An As-Rigid-As Possible Regularization Loss for Learning Deformable Shape Generators..

Mesh TensorFlow: Model Parallelism Made Easier

Building a real-time environment using webcam frame division in OpenCV and classify cropped images using a fine-tuned vision transformers on hybryd datasets samples for facial emotion recognition.

PGPortfolio: Policy Gradient Portfolio, the source code of "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem"(https://arxiv.org/pdf/1706.10059.pdf).

Unsupervised Learning of Probably Symmetric Deformable 3D Objects from Images in the Wild

Python Library for learning (Structure and Parameter) and inference (Statistical and Causal) in Bayesian Networks.

Computer Vision application in the web

Official PyTorch implementation of "BlendGAN: Implicitly GAN Blending for Arbitrary Stylized Face Generation" (NeurIPS 2021)

这是一个deeplabv3-plus-pytorch的源码，可以用于训练自己的模型。

[ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and OpenImages

PFFDTD is an open-source FDTD simulator for 3D room acoustics

Neighbor2Seq: Deep Learning on Massive Graphs by Transforming Neighbors to Sequences

Official Repository for our ICCV2021 paper: Continual Learning on Noisy Data Streams via Self-Purified Replay

Vision-and-Language Navigation in Continuous Environments using Habitat

A program that can analyze videos according to the weights you select

Official implementation of "SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers"

The `rtdl` library + The official implementation of the paper

RepVGG: Making VGG-style ConvNets Great Again

This project deploys a yolo fastest model in the form of tflite on raspberry 3b+. The model is from another repository of mine called -Trash-Classification-Car

Unified Pre-training for Self-Supervised Learning and Supervised Learning for ASR