Research code for CVPR 2021 paper "End-to-End Human Pose and Mesh Reconstruction with Transformers"

Last update: Dec 31, 2022

Related tags

Overview

MeshTransformer ✨

This is our research code of End-to-End Human Pose and Mesh Reconstruction with Transformers.

MEsh TRansfOrmer is a simple yet effective transformer-based method for human pose and mesh reconsruction from an input image. In this repository, we provide our research code for training and testing our proposed method for the following tasks:

Human pose and mesh reconstruction
Hand pose and mesh reconstruction

Installation

Check INSTALL.md for installation instructions.

Model Zoo and Download

Please download our pre-trained models and other relevant files that are important to run our code.

Check DOWNLOAD.md for details.

Quick demo

We provide demo codes to run end-to-end inference on the test images.

Check DEMO.md for details.

Experiments

We provide python codes for training and evaluation.

Check EXP.md for details.

Contributing

We welcome contributions and suggestions. Please check CONTRIBUTE and CODE_OF_CONDUCT for details.

Citations

If you find our work useful in your research, please consider citing:

@inproceedings{lin2021end-to-end,
author = {Lin, Kevin and Wang, Lijuan and Liu, Zicheng},
title = {End-to-End Human Pose and Mesh Reconstruction with Transformers},
booktitle = {CVPR},
year = {2021},
}

License

Our research code is released under the MIT license. See LICENSE for details.

We use submodules from third parties, such as huggingface/transformers and hassony2/manopth. Please see NOTICE for details.

We note that any use of SMPL models and MANO models are subject to Software Copyright License for non-commercial scientific research purposes. See SMPL-Model License and MANO License for details.

Acknowledgments

Our implementation and experiments are built on top of open-source GitHub repositories. We thank all the authors who made their code public, which tremendously accelerates our project progress. If you find these works helpful, please consider citing them as well.

huggingface/transformers

HRNet/HRNet-Image-Classification

hongsukchoi/Pose2Mesh_RELEASE

mks0601/I2L-MeshNet_RELEASE

open-mmlab/mmdetection

Research code for CVPR 2021 paper "End-to-End Human Pose and Mesh Reconstruction with Transformers"

Related tags

Overview

MeshTransformer ✨

Installation

Model Zoo and Download

Quick demo

Experiments

Contributing

Citations

License

Acknowledgments

Owner

Microsoft

PyTorch implementation of Algorithm 1 of "On the Anatomy of MCMC-Based Maximum Likelihood Learning of Energy-Based Models"

Official Implementation of LARGE: Latent-Based Regression through GAN Semantics

Ensembling Off-the-shelf Models for GAN Training

Citation Intent Classification in scientific papers using the Scicite dataset an Pytorch

A framework for the elicitation, specification, formalization and understanding of requirements.

This is the code for our KILT leaderboard submission to the T-REx and zsRE tasks. It includes code for training a DPR model then continuing training with RAG.

Code for our paper "Multi-scale Guided Attention for Medical Image Segmentation"

Repository for training material for the 2022 SDSC HPC/CI User Training Course

MMRazor: a model compression toolkit for model slimming and AutoML

Super Pix Adv - Offical implemention of Robust Superpixel-Guided Attentional Adversarial Attack (CVPR2020)

Deep Crop Rotation

ktrain is a Python library that makes deep learning and AI more accessible and easier to apply

GemNet model in PyTorch, as proposed in "GemNet: Universal Directional Graph Neural Networks for Molecules" (NeurIPS 2021)

Learning Temporal Consistency for Low Light Video Enhancement from Single Images (CVPR2021)

Bunch of different tools which helps visualizing and annotating images for semantic/instance segmentation tasks

GANTheftAuto is a fork of the Nvidia's GameGAN

Vehicle Detection Using Deep Learning and YOLO Algorithm

BisQue is a web-based platform designed to provide researchers with organizational and quantitative analysis tools for 5D image data. Users can extend BisQue by implementing containerized ML workflows.

Recursive Bayesian Networks

An example of time series augmentation methods with Keras