Activity image-based video retrieval

Last update: Oct 21, 2021

Related tags

Overview

Cross-modal-retrieval

Our approach is focus on Activity Image-to-Video Retrieval (AIVR) task. The compared methods are state-of-the-art single modality hashing methods, multiple modalities hashing methods and cross-modal retrieval methods.

Single modality hashing methods

Some hashing baselines for image retrieval can be found in https://github.com/willard-yuan/hashing-baseline-for-image-retrieval.

Multiple modalities hashing methods

More details refer to https://github.com/czxxjtu/Hash-Learning.github.io. Some details about hashing methods are in hashing-baseline-for-image-retrieval-master folder.

Cross-modal retrieval methods

The compared cross-modal retrieval methods are according to the paper:

Datasets

THUMOS'14 Dataset:

https://pan.baidu.com/s/1H6c8nh_Hs7gVkhESpxtvAg 提取码：qp26

ActivityNet Dataset:

https://pan.baidu.com/s/1P0jRecEmplCPaTPwFoOpVQ 提取码：pnw9

Bibtex

When using images from our dataset, please cite our paper using the following BibTeX[PDF]：

@article{pba2020,
author    = {Ruicong Xu and Li Niu and Jianfu Zhang and Liqing Zhang},
title     = {A Proposal-based Approach for Activity Image-to-Video Retrieval},
journal   = {AAAI},
year      = {2020}}

Activity image-based video retrieval

Related tags

Overview

Cross-modal-retrieval

Single modality hashing methods

Multiple modalities hashing methods

Cross-modal retrieval methods

Datasets

THUMOS'14 Dataset:

ActivityNet Dataset:

Bibtex

Owner

BCMI

Implementation for Curriculum DeepSDF

Official Pytorch implementation of "Learning to Estimate Robust 3D Human Mesh from In-the-Wild Crowded Scenes", CVPR 2022

Context Axial Reverse Attention Network for Small Medical Objects Segmentation

This repository contains the implementation of the HealthGen model, a generative model to synthesize realistic EHR time series data with missingness

This is an open-source toolkit for Heterogeneous Graph Neural Network(OpenHGNN) based on DGL [Deep Graph Library] and PyTorch.

Codes of paper "Unseen Object Amodal Instance Segmentation via Hierarchical Occlusion Modeling"

Uncertainty Estimation via Response Scaling for Pseudo-mask Noise Mitigation in Weakly-supervised Semantic Segmentation

AQP is a modular pipeline built to enable the comparison and testing of different quality metric configurations.

The code written during my Bachelor Thesis "Classification of Human Whole-Body Motion using Hidden Markov Models".

HEAM: High-Efficiency Approximate Multiplier Optimization for Deep Neural Networks

Multi-Scale Progressive Fusion Network for Single Image Deraining

This repository contains codes of ICCV2021 paper: SO-Pose: Exploiting Self-Occlusion for Direct 6D Pose Estimation

Code Repository for The Kaggle Book, Published by Packt Publishing

PyTorch common framework to accelerate network implementation, training and validation

Air Pollution Prediction System using Linear Regression and ANN

Frequency Domain Image Translation: More Photo-realistic, Better Identity-preserving

Python library containing BART query generation and BERT-based Siamese models for neural retrieval.

Incorporating Transformer and LSTM to Kalman Filter with EM algorithm

🐥A PyTorch implementation of OpenAI's finetuned transformer language model with a script to import the weights pre-trained by OpenAI

Identify the emotion of multiple speakers in an Audio Segment