Official implementation for Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos

Last update: Oct 18, 2022

Related tags

Deep Learning MIGCN

Overview

Multi-modal Interaction Graph Convolutioal Network for Temporal Language Localization in Videos

Official implementation for Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos

Model Pipeline

Usage

Environment Settings

We use the PyTorch framework.

Python version: 3.7.0
PyTorch version: 1.4.0

Get Code

Clone the repository:

git clone https://github.com/zmzhang2000/MIGCN.git
cd MIGCN

Data Preparation

Charades-STA

Download the preprocessed annotations and features of Charades-STA with I3D features.
Save them in data/charades.

ActivityNet

Download the preprocessed annotations of ActivityNet.
Download the C3D features of ActivityNet.
Process the C3D feature according to process_activitynet_c3d() in data/preprocess/preprocess.py.
Save them in data/activitynet.

Pre-trained Models

Download the checkpoints of Charades-STA and ActivityNet.
Save them in checkpoints

Data Generation

We provide the generation procedure of all MIGCN data.

The raw data is listed in data/raw_data/download.sh.
The preprocess code is in data/preprocess.

Training

Train MIGCN on Charades-STA with I3D feature:

python main.py --dataset charades --feature i3d

Train MIGCN on ActivityNet with C3D feature:

python main.py --dataset activitynet --feature c3d

Testing

Test MIGCN on Charades-STA with I3D feature:

python main.py --dataset charades --feature i3d --test --model_load_path checkpoints/$MODEL_CHECKPOINT

Test MIGCN on ActivityNet with C3D feature:

python main.py --dataset activitynet --feature c3d --test --model_load_path checkpoints/$MODEL_CHECKPOINT

Other Hyper-parameters

List other hyper-parameters by:

python main.py -h

Reference

Please cite the following paper if MIGCN is helpful for your research

@ARTICLE{9547801,
  author={Zhang, Zongmeng and Han, Xianjing and Song, Xuemeng and Yan, Yan and Nie, Liqiang},
  journal={IEEE Transactions on Image Processing}, 
  title={Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos}, 
  year={2021},
  volume={30},
  number={},
  pages={8265-8277},
  doi={10.1109/TIP.2021.3113791}}

Official implementation for Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos

Related tags

Overview

Multi-modal Interaction Graph Convolutioal Network for Temporal Language Localization in Videos

Model Pipeline

Usage

Environment Settings

Get Code

Data Preparation

Charades-STA

ActivityNet

Pre-trained Models

Data Generation

Training

Testing

Other Hyper-parameters

Reference

Owner

Zongmeng Zhang

MLP-Like Vision Permutator for Visual Recognition (PyTorch)

BirdCLEF 2021 - Birdcall Identification 4th place solution

[CVPRW 21] "BNN - BN = ? Training Binary Neural Networks without Batch Normalization", Tianlong Chen, Zhenyu Zhang, Xu Ouyang, Zechun Liu, Zhiqiang Shen, Zhangyang Wang

Pretty Tensor - Fluent Neural Networks in TensorFlow

Release of the ConditionalQA dataset

PyTorch implementation of our paper: Decoupling and Recoupling Spatiotemporal Representation for RGB-D-based Motion Recognition

This repository provides some of the code implemented and the data used for the work proposed in "A Cluster-Based Trip Prediction Graph Neural Network Model for Bike Sharing Systems".

QSYM: A Practical Concolic Execution Engine Tailored for Hybrid Fuzzing

Generative Flow Networks

MIMO-UNet - Official Pytorch Implementation

PyTorch implementation of InstaGAN: Instance-aware Image-to-Image Translation

A transformer model to predict pathogenic mutations

The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"

Rethinking the Importance of Implementation Tricks in Multi-Agent Reinforcement Learning

This repo. is an implementation of ACFFNet, which is accepted for in Image and Vision Computing.

A Next Generation ConvNet by FaceBookResearch Implementation in PyTorch(Original) and TensorFlow.

Styleformer - Official Pytorch Implementation

Pytorch implementation of the paper "Topic Modeling Revisited: A Document Graph-based Neural Network Perspective"

Fully Connected DenseNet for Image Segmentation

Repositório criado para abrigar os notebooks com a listas de exercícios propostos pelo professor Gustavo Guanabara do canal Curso em Vídeo do YouTube durante o Curso de Python 3