[ICRA 2022] CaTGrasp: Learning Category-Level Task-Relevant Grasping in Clutter from Simulation

Last update: Jan 04, 2023

Overview

This is the official implementation of our paper:

Bowen Wen, Wenzhao Lian, Kostas Bekris, and Stefan Schaal. "CaTGrasp: Learning Category-Level Task-Relevant Grasping in Clutter from Simulation." IEEE International Conference on Robotics and Automation (ICRA) 2022.

Abstract

Task-relevant grasping is critical for industrial assembly, where downstream manipulation tasks constrain the set of valid grasps. Learning how to perform this task, however, is challenging, since task-relevant grasp labels are hard to define and annotate. There is also yet no consensus on proper representations for modeling or off-the-shelf tools for performing task-relevant grasps. This work proposes a framework to learn task-relevant grasping for industrial objects without the need of time-consuming real-world data collection or manual annotation. To achieve this, the entire framework is trained solely in simulation, including supervised training with synthetic label generation and self-supervised, hand-object interaction. In the context of this framework, this paper proposes a novel, object-centric canonical representation at the category level, which allows establishing dense correspondence across object instances and transferring task-relevant grasps to novel instances. Extensive experiments on task-relevant grasping of densely-cluttered industrial objects are conducted in both simulation and real-world setups, demonstrating the effectiveness of the proposed framework.

Bibtex

@article{wen2021catgrasp,
  title={CaTGrasp: Learning Category-Level Task-Relevant Grasping in Clutter from Simulation},
  author={Wen, Bowen and Lian, Wenzhao and Bekris, Kostas and Schaal, Stefan},
  journal={ICRA 2022},
  year={2022}
}

Supplementary Video

Click to watch

ICRA 2022 Presentation Video

Quick Setup

We provide docker environment and setup is as easy as below a few lines.

If you haven't installed docker, firstly install (https://docs.docker.com/get-docker/).

Run

docker pull wenbowen123/catgrasp:latest

To enter the docker, run below
```
cd  docker && bash run_container.sh
cd /home/catgrasp && bash build.sh
```
Now the environment is ready to run training or testing.

Data

Download object models and pretrained network weights from here. Then extract and replace the files in this repo, to be like:

  catgrasp
  ├── artifacts
  ├── data
  └── urdf

Testing

python run_grasp_simulation.py

You should see the demo starting like below. You can play with the settings in config_run.yml, including changing different object instances within the category while using the same framework

Training

In the following, we take the nut category as an example to walk through

Compute signed distance function for all objects of the category
```
python make_sdf.py --class_name nut
```
Pre-compute offline grasps of training objects. This generates and evaluates grasp qualities regardless of their task-relevance. To visualize and debug the grasp quality evaluation change to --debug 1
```
python generate_grasp.py --class_name nut --debug 0
```
Self-supervised task-relevance discovery in simulation
```
python pybullet_env/env_semantic_grasp.py --class_name nut --debug 0
```
Changing --debug 0 to --debug 1, you are able to debug and visualize the process

The affordance results will be saved in data/object_models. The heatmap file XXX_affordance_vis can be visualized as in the below image, where warmer area means higher task-relevant grasping region P(T|G)
Make the canonical model that stores category-level knowledge
```
python make_canonical.py --class_name nut
```

Training data generation of piles

python generate_pile_data.py --class_name nut

Process training data, including generating ground-truth labels
```
python tool.py
```
To train NUNOCS net, examine the settings in config_nunocs.yml, then
```
python train_nunocs.py
```
To train grasping-Q net, examine the settings in config_grasp.yml, then
```
python train_grasp.py
```
To train instance segmentation net, examine the settings in PointGroup/config/config_pointgroup.yaml, then
```
python train_pointgroup.py
```

[ICRA 2022] CaTGrasp: Learning Category-Level Task-Relevant Grasping in Clutter from Simulation

Related tags

Overview

Abstract

Bibtex

Supplementary Video

ICRA 2022 Presentation Video

Quick Setup

Data

Testing

Training

Owner

Bowen Wen

Repository for scripts and notebooks from the book: Programming PyTorch for Deep Learning

PASTRIE: A Corpus of Prepositions Annotated with Supersense Tags in Reddit International English

CTF challenges and write-ups for MicroCTF 2021.

Sharpness-Aware Minimization for Efficiently Improving Generalization

Vision Transformer for 3D medical image registration (Pytorch).

Official repository of "Investigating Tradeoffs in Real-World Video Super-Resolution"

Vector AI — A platform for building vector based applications. Encode, query and analyse data using vectors.

Robust and Accurate Object Detection via Self-Knowledge Distillation

Label Studio is a multi-type data labeling and annotation tool with standardized output format

A script helps the user to update Linux and Mac systems through the terminal

Tools to create pixel-wise object masks, bounding box labels (2D and 3D) and 3D object model (PLY triangle mesh) for object sequences filmed with an RGB-D camera.

This is the code for the paper "Contrastive Clustering" (AAAI 2021)

Compute descriptors for 3D point cloud registration using a multi scale sparse voxel architecture

Real-time pose estimation accelerated with NVIDIA TensorRT

The official PyTorch implementation for NCSNv2 (NeurIPS 2020)

AdelaiDepth is an open source toolbox for monocular depth prediction.

The VarCNN is an Convolution Neural Network based approach to automate Video Assistant Referee in football.

Cereal box identification in store shelves using computer vision and a single train image per model.

EASY - Ensemble Augmented-Shot Y-shaped Learning: State-Of-The-Art Few-Shot Classification with Simple Ingredients.

Keras Image Embeddings using Contrastive Loss