Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis

Last update: Jul 08, 2021

Related tags

Overview

Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis

This is a PyTorch implementation of the model described in our paper:

Z. Qi, S. Wang, C. Su, L. Su, W. Zhang, and Q. Huang. Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis. ACM MM 2020.

Dependencies

Pytorch 1.2.0
Cuda 9.2.148
Cudnn 7.6.2
Opencv-python 4.2.0.34
Python 3.6.9

Data

Dataset Prepare

Download the pre-trained concept detector weights from Baidu passward 'wv0e' or Google Grive and put them in folder weights/
Download the FCVID dataset from http://bigvid.fudan.edu.cn/FCVID/.
The annotation information of each dataset is provided in folder data/FCVID/video_labels.
Extract the video frames for each video and put the extracted frames in folder data/FCVID/frames/.

For ActivityNet dataset ( http://activity-net.org/. ) , we use the latest released version of the dataset (v1.3).

Train

python main.py --gpu_ids 0,1 --model_name tdcmn_si_soa --dataset FCVID --no_test

for other hyperparameters, please refer to opts.py file.

Test

Pretrained model weigths are avaiable in Baidu passward 'szlk' or Google Grive
Download the pre-trained weights and put them in folder results/
python main.py --gpu_ids 0,1 --model_name tdcmn_si_soa --dataset FCVID --resume_path pretrained_model/tdcmn_si_soa.pth --no_train --test_crop_number 1

Citation

Please cite our paper if you use this code in your own work:

@inproceedings{qi2020modeling,
  title={Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis},
  author={Qi, Zhaobo and Wang, Shuhui and Su, Chi and Su, Li and Zhang, Weigang and Huang, Qingming},
  booktitle={Proceedings of the 28th ACM International Conference on Multimedia},
  pages={3798--3806},
  year={2020}
}

Contcat

If you have any problem about our code, feel free to contact

[email protected]

Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis

Related tags

Overview

Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis

Dependencies

Data

Dataset Prepare

Train

Test

Citation

Contcat

Owner

qzhb

Large-scale open domain KNOwledge grounded conVERsation system based on PaddlePaddle

Pytorch implementation of four neural network based domain adaptation techniques: DeepCORAL, DDC, CDAN and CDAN+E. Evaluated on benchmark dataset Office31.

Python implementation of ADD: Frequency Attention and Multi-View based Knowledge Distillation to Detect Low-Quality Compressed Deepfake Images, AAAI2022.

[CVPR 2021] Anycost GANs for Interactive Image Synthesis and Editing

A fast, dataset-agnostic, deep visual search engine for digital art history

The codes and models in 'Gaze Estimation using Transformer'.

Simple Tensorflow implementation of Toward Spatially Unbiased Generative Models (ICCV 2021)

Official implementation of "GS-WGAN: A Gradient-Sanitized Approach for Learning Differentially Private Generators" (NeurIPS 2020)

Machine Learning with JAX Tutorials

This is a Deep Leaning API for classifying emotions from human face and human audios.

DCA - Official Python implementation of Delaunay Component Analysis algorithm

Sample code from the Neural Networks from Scratch book.

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

MADT: Offline Pre-trained Multi-Agent Decision Transformer

Code Release for ICCV 2021 (oral), "AdaFit: Rethinking Learning-based Normal Estimation on Point Clouds"

Semantic Segmentation Architectures Implemented in PyTorch

A Robust Non-IoU Alternative to Non-Maxima Suppression in Object Detection

Repositorio oficial del curso IIC2233 Programación Avanzada 🚀✨

Wider or Deeper: Revisiting the ResNet Model for Visual Recognition

Code for "LASR: Learning Articulated Shape Reconstruction from a Monocular Video". CVPR 2021.