BABEL: Bodies, Action and Behavior with English Labels [CVPR 2021]

Last update: Dec 28, 2022

Related tags

Overview

BABEL: Bodies, Action and Behavior with English Labels [CVPR 2021]

Abhinanda R. Punnakkal*, Arjun Chandrasekaran*, Nikos Athanasiou, Alejandra Quiros-Ramirez, Michael J. Black. * denotes equal contribution

Project Website | Paper | Video | Poster

BABEL is a large dataset with language labels describing the actions being performed in mocap sequences. BABEL labels about 43 hours of mocap sequences from AMASS [1] with action labels. Sequences have action labels at two possible levels of abstraction:

Sequence labels which describe the overall action in the sequence
Frame labels which describe all actions in every frame of the sequence. Each frame label is precisely aligned with the duration of the corresponding action in the mocap sequence, and multiple actions can overlap.

To download the BABEL action labels, visit our 'Data' page. You can download the mocap sequences from AMASS.

Tutorials

We release some helper code in Jupyter notebooks to load the BABEL dataset, visualize mocap sequences and their action labels, search BABEL for sequences containing specific actions, etc.

See notebooks/ for more details.

Action Recognition

We provide features, training and inference code, and pre-trained checkpoints for 3D skeleton-based action recognition.

Please see action_recognition/ for more details.

Acknowledgements

The notebooks in this repo are inspired by the those provided by AMASS. The Action Recognition code is based on the 2s-AGCN implementation.

References

[1] Mahmood, Naureen, et al. "AMASS: Archive of motion capture as surface shapes." Proceedings of the IEEE/CVF International Conference on Computer Vision. 2019.

License

Software Copyright License for non-commercial scientific research purposes. Please read carefully the terms and conditions and any accompanying documentation before you download and/or use the AMASS dataset, and software, (the "Model & Software"). By downloading and/or using the Model & Software (including downloading, cloning, installing, and any other use of this GitHub repository), you acknowledge that you have read these terms and conditions, understand them, and agree to be bound by them. If you do not agree with these terms and conditions, you must not download and/or use the Model & Software. Any infringement of the terms of this agreement will automatically terminate your rights under this License.

Contact

The code in this repository is developed by Abhinanda Punnakkal and Arjun Chandrasekaran.

If you have any questions you can contact us at [email protected].

BABEL: Bodies, Action and Behavior with English Labels [CVPR 2021]

Related tags

Overview

BABEL: Bodies, Action and Behavior with English Labels [CVPR 2021]

Tutorials

Action Recognition

Acknowledgements

References

License

Contact

Owner

TJU Deep Learning & Neural Network

The official implementation code of "PlantStereo: A Stereo Matching Benchmark for Plant Surface Dense Reconstruction."

PyTorch implementation of CVPR'18 - Perturbative Neural Networks

The PyTorch implementation of DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence from Box Supervision.

Official PyTorch implementation of "IntegralAction: Pose-driven Feature Integration for Robust Human Action Recognition in Videos", CVPRW 2021

The codebase for our paper "Generative Occupancy Fields for 3D Surface-Aware Image Synthesis" (NeurIPS 2021)

Api for getting bin info and getting encrypted card details for adyen.

The final project for "Applying AI to Wearable Device Data" course from "AI for Healthcare" - Udacity.

Rename Images with Auto Generated Neural Image Captions

Estimating Example Difficulty using Variance of Gradients

WarpRNNT loss ported in Numba CPU/CUDA for Pytorch

High-performance moving least squares material point method (MLS-MPM) solver.

Towards Fine-Grained Reasoning for Fake News Detection

Official implementation for: Blended Diffusion for Text-driven Editing of Natural Images.

LeViT a Vision Transformer in ConvNet's Clothing for Faster Inference

This repository contains a pytorch implementation of "StereoPIFu: Depth Aware Clothed Human Digitization via Stereo Vision".

Photo2cartoon - 人像卡通化探索项目 (photo-to-cartoon translation project)

AdvStyle - Official PyTorch Implementation

Code for STFT Transformer used in BirdCLEF 2021 competition.

Official code for 'Weakly-supervised Video Anomaly Detection with Robust Temporal Feature Magnitude Learning' [ICCV 2021]