A large-scale video dataset for the training and evaluation of 3D human pose estimation models

Last update: Jun 20, 2021

Overview

ASPset-510

ASPset-510 (Australian Sports Pose Dataset) is a large-scale video dataset for the training and evaluation of 3D human pose estimation models. It contains 17 different amateur subjects performing 30 sports-related actions each, for a total of 510 action clips.

This repository contains Python code for working with ASPset-510.

If you don't want to use these scripts and would prefer to directly download the data yourself, ASPset-510 is available on the Internet Archive at https://archive.org/details/aspset510.

Requirements

Core

$ conda env create -f environment.yml

python >= 3.6
numpy
ezc3d
posekit

GUI (Optional)

$ conda env update -f environment-gui.yml

PyOpenGL
glfw
matplotlib

PyTorch (Optional)

$ conda env update -f environment-torch.yml

Scripts

Downloading the dataset

download_data.py downloads and extracts ASPset-510 data.

Example usage:

$ python src/aspset510/bin/download_data.py --data-dir=./data

Note that by default the original archive files will be downloaded and kept in the archives subdirectory of whichever path you set using --data-dir. To set a different path for the archives, use the --archive-dir option. To download the archives without extracting them, use the --skip-extraction option.

Browsing clips from the dataset

browse_clips.py provides a graphical user interface for browsing clips from ASPset-510.

Example usage:

$ python src/aspset510/bin/browse_clips.py --data-dir=./data

Acknowledgments and license

ASPset-510 is brought to you by La Trobe University and the Australian Institute of Sport. It is dedicated to the public domain under the CC0 1.0 license.

If you find this dataset useful for your own work, please cite the following paper:

@article{nibali2021aspset,
  title={{ASPset}: An Outdoor Sports Pose Video Dataset With {3D} Keypoint Annotations},
  author={Nibali, Aiden and Millward, Joshua and He, Zhen and Morgan, Stuart},
  journal={Image and Vision Computing},
  pages={104196},
  year={2021},
  issn={0262-8856},
  doi={https://doi.org/10.1016/j.imavis.2021.104196},
  url={https://www.sciencedirect.com/science/article/pii/S0262885621001013},
  publisher={Elsevier}
}

A large-scale video dataset for the training and evaluation of 3D human pose estimation models

Related tags

Overview

ASPset-510

Requirements

Core

GUI (Optional)

PyTorch (Optional)

Scripts

Downloading the dataset

Browsing clips from the dataset

Acknowledgments and license

Owner

Aiden Nibali

Digitalizing-Prescription-Image - PIRDS - Prescription Image Recognition and Digitalizing System is a OCR make with Tensorflow

Scaling Vision with Sparse Mixture of Experts

Official PyTorch implementation of SyntaSpeech (IJCAI 2022)

A python implementation of Deep-Image-Analogy based on pytorch.

Projecting interval uncertainty through the discrete Fourier transform

Implementation of PyTorch-based multi-task pre-trained models

The object detection pipeline is based on Ultralytics YOLOv5

[ICCV 2021] Encoder-decoder with Multi-level Attention for 3D Human Shape and Pose Estimation

Keras Image Embeddings using Contrastive Loss

Compositional and Parameter-Efficient Representations for Large Knowledge Graphs

TensorFlow implementation of the paper "Hierarchical Attention Networks for Document Classification"

Code for the Convolutional Vision Transformer (ConViT)

The official implementation of You Only Compress Once: Towards Effective and Elastic BERT Compression via Exploit-Explore Stochastic Nature Gradient.

Source code for Fathony, Sahu, Willmott, & Kolter, "Multiplicative Filter Networks", ICLR 2021.

Crawl & visualize ICLR papers and reviews

《Where am I looking at? Joint Location and Orientation Estimation by Cross-View Matching》(CVPR 2020)

🍅🍅🍅YOLOv5-Lite: lighter, faster and easier to deploy. Evolved from yolov5 and the size of model is only 1.7M (int8) and 3.3M (fp16). It can reach 10+ FPS on the Raspberry Pi 4B when the input size is 320×320~

Synthetic LiDAR sequential point cloud dataset with point-wise annotations

A Kitti Road Segmentation model implemented in tensorflow.

The Hailo Model Zoo includes pre-trained models and a full building and evaluation environment