This is a repository of our model for weakly-supervised video dense anticipation.

Last update: Apr 09, 2022

Related tags

Deep Learning WSLVideoDenseAnticipation

Overview

Introduction

This is a repository of our model for weakly-supervised video dense anticipation.

More results on GTEA, Epic-Kitchens etc. will come soon and publish here.

Please refer to our paper Weakly-Supervised Dense ActionAnticipation, published in The British Machine Vision Conference (BMVC), 2021. Paper link: http://arxiv.org/abs/2111.07593

How to use the code

python main.py --dataset YOURDATASET --feature_type YOURFEATURETYPE --n_classes NUMBEROFCLASSES --observation OBSERVEPERCENTAGE --prediction PREDICTPERCENTAGE --fps VIDEOFPS --batch BATCH --model PATHTOSAVEMODEL

Please refer to main.py for the meaning of each argument.

The code is written on the basis of the ECCV 2020 paper Temporal Aggregate Representations for Long-Range Video Understanding, which is one of the backbones we used in our paper. Please refer to this repository https://github.com/dipika-singhania/multi-scale-action-banks for the original code. The default arguments in main.py follow this paper.

Please contact the authors of the above ECCV paper if you need the original data. If you want to use your own data, please format it as the original data, or edit data_preprocessing.py and data_loader.py.

This is a repository of our model for weakly-supervised video dense anticipation.

Related tags

Overview

Introduction

How to use the code

Owner

Seg-Torch for Image Segmentation with Torch

Dashboard for the COVID19 spread

Applying CLIP to Point Cloud Recognition.

BERT model training impelmentation using 1024 A100 GPUs for MLPerf Training v1.1

Collective Multi-type Entity Alignment Between Knowledge Graphs (WWW'20)

A collection of inference modules for fastai2

RP-GAN: Stable GAN Training with Random Projections

Perturb-and-max-product: Sampling and learning in discrete energy-based models

Align and Prompt: Video-and-Language Pre-training with Entity Prompts

Research into Forex price prediction from price history using Deep Sequence Modeling with Stacked LSTMs.

reimpliment of DFANet: Deep Feature Aggregation for Real-Time Semantic Segmentation

The Balloon Learning Environment - flying stratospheric balloons with deep reinforcement learning.

Code and real data for the paper "Counterfactual Temporal Point Processes", available at arXiv.

Clinica is a software platform for clinical research studies involving patients with neurological and psychiatric diseases and the acquisition of multimodal data

CoReD: Generalizing Fake Media Detection with Continual Representation using Distillation (ACMMM'21 Oral Paper)

Official implementation of the paper: "LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech"

Tensorflow 2 Object Detection API kurulumu, GPU desteği, custom model hazırlama

The MLOps platform for innovators 🚀

Differentiable simulation for system identification and visuomotor control

Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CIKM 2020)