UMPNet: Universal Manipulation Policy Network for Articulated Objects

Last update: Dec 03, 2022

Overview

UMPNet: Universal Manipulation Policy Network for Articulated Objects

Zhenjia Xu, Zhanpeng He, Shuran Song
Columbia University
Robotics and Automation Letters (RA-L) / ICRA 2022

Project Page | Video | arXiv

Overview

This repo contains the PyTorch implementation for paper "UMPNet: Universal Manipulation Policy Network for Articulated Objects".

Prerequisites

The code is built with Python 3.6. Libraries are listed in requirements.txt and can be installed with pip by:

pip install -r requirements.txt

Data Preparation

Prepare object URDF and pretrained model.

mobility_dataset: URDF of 12 training and 10 testing object categories.
pretrained: pretrained model.

Download, unzip, and organize as follows:

/umpnet
    /mobility_dataset
    /pretrained
    ...

Testing

Test with GUI

There are also two modes of testing: exploration and manipulation.

# Open-ended state exploration
python test_gui.py --mode exploration --category CATEGORY

# Goal conditioned manipulation
python test_gui.py --mode manipulation --category CATEGORY

Here CATEGORY can be chosen from:

training categories]: Refrigerator, FoldingChair, Laptop, Stapler, TrashCan, Microwave, Toilet, Window, StorageFurniture, Switch, Kettle, Toy
[Testing categories]: Box, Phone, Dishwasher, Safe, Oven, WashingMachine, Table, KitchenPot, Bucket, Door

Quantitative Evaluation

There are also two modes of testing: exploration and manipulation.

# Open-ended state exploration
python test_quantitative.py --mode exploration

# Goal conditioned manipulation
python test_quantitative.py --mode manipulation

By default, it will run quantitative evaluation for each category. You can modify pool_list(L91) to run evaluation for a specific category.

Training

Hyper-parameters mentioned in paper are provided in default arguments.

python train.py --exp EXP_NAME

Then a directory will be created at exp/EXP_NAME, in which checkpoints, visualization, and replay buffer will be stored.

BibTeX

@article{xu2022umpnet,
  title={UMPNet: Universal manipulation policy network for articulated objects},
  author={Xu, Zhenjia and Zhanpeng, He and Song, Shuran},
  journal={IEEE Robotics and Automation Letters},
  year={2022},
  publisher={IEEE}
}

License

This repository is released under the MIT license. See LICENSE for additional details.

Acknowledgement

The code for spherical sampling is modified from area-beamforming.
The code for UNet is modified from Pytorch-UNet.

UMPNet: Universal Manipulation Policy Network for Articulated Objects

Related tags

Overview

UMPNet: Universal Manipulation Policy Network for Articulated Objects

Project Page | Video | arXiv

Overview

Content

Prerequisites

Data Preparation

Testing

Test with GUI

Quantitative Evaluation

Training

BibTeX

License

Acknowledgement

Owner

Columbia Artificial Intelligence and Robotics Lab

Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.

A framework for multi-step probabilistic time-series/demand forecasting models

Bilinear attention networks for visual question answering

[CVPR 2021] MiVOS - Scribble to Mask module

Neural network for stock price prediction

FrankMocap: A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator

Robust Instance Segmentation through Reasoning about Multi-Object Occlusion [CVPR 2021]

QI-Q RoboMaster2022 CV Algorithm

The source code and data of the paper "Instance-wise Graph-based Framework for Multivariate Time Series Forecasting".

BARF: Bundle-Adjusting Neural Radiance Fields 🤮 (ICCV 2021 oral)

DABO: Data Augmentation with Bilevel Optimization

Re-implementation of the vector capsule with dynamic routing

3D dataset of humans Manipulating Objects in-the-Wild (MOW)

Quasi-Dense Similarity Learning for Multiple Object Tracking, CVPR 2021 (Oral)

3D2Unet: 3D Deformable Unet for Low-Light Video Enhancement (PRCV2021)

Learning Time-Critical Responses for Interactive Character Control

Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL

API for RL algorithm design & testing of BCA (Building Control Agent) HVAC on EnergyPlus building energy simulator by wrapping their EMS Python API

Optimized Gillespie algorithm for simulating Stochastic sPAtial models of Cancer Evolution (OG-SPACE)

ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration