Code for paper: Towards Tokenized Human Dynamics Representation

Last update: May 31, 2022

Overview

Video Tokneization

Codebase for video tokenization, based on our paper Towards Tokenized Human Dynamics Representation.

Prerequisites (tested under Python 3.8 and CUDA 11.1)

apt-get install ffmpeg  
pip install torch==1.8  
pip install torchvision  
pip install pytorch-lightning  
pip install pytorch-lightning-bolts  
pip install aniposelib wandb gym test-tube ffmpeg-python matplotlib easydict scikit-learn

Data Preparation

Make a directory besides this repo and name it aistplusplus
Download from AIST++ website until it looks like

├── annotations
│   ├── cameras
│   ├── ignore_list.txt
│   ├── keypoints2d
│   ├── keypoints3d
│   ├── motions
│   └── splits
└── video_list.txt

How to run

Write one configuration file, e.g., configs/tan.yaml.
Run python pretrain.py --cfg configs/tan.yaml with GPU, which will create a folder under logs for this run. Folder name specified by the NAME in configuration file. Then run python cluster.py --cfg configs/tan.yaml (CPU-only) and check results in demo.ipynb.
Or you can download and unzip my training result into logs folder from here.

Code for paper: Towards Tokenized Human Dynamics Representation

Related tags

Overview

Video Tokneization

Prerequisites (tested under Python 3.8 and CUDA 11.1)

Data Preparation

How to run

Owner

Kenneth Li

Nerf pl - NeRF (Neural Radiance Fields) and NeRF in the Wild using pytorch-lightning

Convert ONNX model graph to Keras model format.

[CVPR 2021] Monocular depth estimation using wavelets for efficiency

A set of tools for creating and testing machine learning features, with a scikit-learn compatible API

An architecture that makes any doodle realistic, in any specified style, using VQGAN, CLIP and some basic embedding arithmetics.

Repositório para arquivos sobre o Módulo 1 do curso Top Coders da Let's Code + Safra

Pyramid Grafting Network for One-Stage High Resolution Saliency Detection. CVPR 2022

New AidForBlind - Various Libraries used like OpenCV and other mentioned in Requirements.txt

Video2x - A lossless video/GIF/image upscaler achieved with waifu2x, Anime4K, SRMD and RealSR.

Just Go with the Flow: Self-Supervised Scene Flow Estimation

REGTR: End-to-end Point Cloud Correspondences with Transformers

Transformers are Graph Neural Networks!

Code release for The Devil is in the Channels: Mutual-Channel Loss for Fine-Grained Image Classification (TIP 2020)

Flexible-Modal Face Anti-Spoofing: A Benchmark

SIMULEVAL A General Evaluation Toolkit for Simultaneous Translation

Source code related to the article submitted to the International Conference on Computational Science ICCS 2022 in London

PyTorch META-DATASET (Few-shot classification benchmark)

NeuroFind - A solution to the to the Task given by the Oberseminar of Messtechnik Institute of TU Dresden in 2021

HuSpaCy: industrial-strength Hungarian natural language processing

The 1st Place Solution of the Facebook AI Image Similarity Challenge (ISC21) : Descriptor Track.