Contrastively Disentangled Sequential Variational Audoencoder

Last update: Dec 24, 2022

Related tags

Overview

Contrastively Disentangled Sequential Variational Audoencoder (C-DSVAE)

Overview

This is the implementation for our C-DSVAE, a novel self-supervised disentangled sequential representation learning method.

Requirements

Python 3
PyTorch 1.7
Numpy 1.18.5

Dataset

Sprites

We provide the raw Sprites .npy files. One can also find the dataset on a third-party repo.

For each split (train/test), we expect the following components for each sequence sample

x: raw sample of shape [8, 3, 64, 64]
c_aug: content augmentation of shape [8, 3, 64, 64]
m_aug: motion augmentation of shape [8, 3, 64, 64]
motion factors: action (3 classes), direction (3 classes)
content factors: skin, tops, pants, hair (each with 6 classes)

Running

Train

./run_cdsvae.sh

Test

./run_test_sprite.sh

Classification Judge

The judge classifiers are pretrained with full supervision separately.

Sprites judge

C-DSVAE Checkpoints

We provide a sample Sprites checkpoint. Checkpoint parameters can be found in ./run_test_sprite.sh.

Paper

If you are inspired by our work, please cite the following paper:

@inproceedings{bai2021contrastively,
  title={Contrastively Disentangled Sequential Variational Autoencoder},
  author={Bai, Junwen and Wang, Weiran and Gomes, Carla},
  booktitle={Advances in Neural Information Processing Systems},
  volume={},
  year={2021}
}

Contrastively Disentangled Sequential Variational Audoencoder

Related tags

Overview

Contrastively Disentangled Sequential Variational Audoencoder (C-DSVAE)

Overview

Requirements

Dataset

Sprites

Running

Train

Test

Classification Judge

C-DSVAE Checkpoints

Paper

Owner

Junwen Bai

Data, model training, and evaluation code for "PubTables-1M: Towards a universal dataset and metrics for training and evaluating table extraction models".

unet for image segmentation

A minimal solution to hand motion capture from a single color camera at over 100fps. Easy to use, plug to run.

这个开源项目主要是对经典的时间序列预测算法论文进行复现，模型主要参考自GluonTS，框架主要参考自Informer

Differentiable molecular simulation of proteins with a coarse-grained potential

BARF: Bundle-Adjusting Neural Radiance Fields 🤮 (ICCV 2021 oral)

[NeurIPS 2021] Garment4D: Garment Reconstruction from Point Cloud Sequences

Unofficial pytorch implementation of paper "One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing"

Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)

A high-performance anchor-free YOLO. Exceeding yolov3~v5 with ONNX, TensorRT, NCNN, and Openvino supported.

Simulation environments for the CrazyFlie quadrotor: Used for Reinforcement Learning and Sim-to-Real Transfer

PyTorch implementation of DeepLab v2 on COCO-Stuff / PASCAL VOC

Stochastic gradient descent with model building

Learning Time-Critical Responses for Interactive Character Control

Resources related to our paper "CLIN-X: pre-trained language models and a study on cross-task transfer for concept extraction in the clinical domain"

PyTorch implementation of "Supervised Contrastive Learning" (and SimCLR incidentally)

TYolov5: A Temporal Yolov5 Detector Based on Quasi-Recurrent Neural Networks for Real-Time Handgun Detection in Video

Multi-robot collaborative exploration and mapping through Voronoi partition and DRL in unknown environment

The codebase for Data-driven general-purpose voice activity detection.

Implementation of CVPR'2022:Reconstructing Surfaces for Sparse Point Clouds with On-Surface Priors