Activity image-based video retrieval

Last update: Oct 21, 2021

Related tags

Overview

Cross-modal-retrieval

Our approach is focus on Activity Image-to-Video Retrieval (AIVR) task. The compared methods are state-of-the-art single modality hashing methods, multiple modalities hashing methods and cross-modal retrieval methods.

Single modality hashing methods

Some hashing baselines for image retrieval can be found in https://github.com/willard-yuan/hashing-baseline-for-image-retrieval.

Multiple modalities hashing methods

More details refer to https://github.com/czxxjtu/Hash-Learning.github.io. Some details about hashing methods are in hashing-baseline-for-image-retrieval-master folder.

Cross-modal retrieval methods

The compared cross-modal retrieval methods are according to the paper:

Datasets

THUMOS'14 Dataset:

https://pan.baidu.com/s/1H6c8nh_Hs7gVkhESpxtvAg 提取码：qp26

ActivityNet Dataset:

https://pan.baidu.com/s/1P0jRecEmplCPaTPwFoOpVQ 提取码：pnw9

Bibtex

When using images from our dataset, please cite our paper using the following BibTeX[PDF]：

@article{pba2020,
author    = {Ruicong Xu and Li Niu and Jianfu Zhang and Liqing Zhang},
title     = {A Proposal-based Approach for Activity Image-to-Video Retrieval},
journal   = {AAAI},
year      = {2020}}

Activity image-based video retrieval

Related tags

Overview

Cross-modal-retrieval

Single modality hashing methods

Multiple modalities hashing methods

Cross-modal retrieval methods

Datasets

THUMOS'14 Dataset:

ActivityNet Dataset:

Bibtex

Owner

BCMI

Traffic4D: Single View Reconstruction of Repetitious Activity Using Longitudinal Self-Supervision

It is a simple library to speed up CLIP inference up to 3x (K80 GPU)

A general framework for inferring CNNs efficiently. Reduce the inference latency of MobileNet-V3 by 1.3x on an iPhone XS Max without sacrificing accuracy.

Python script that allows you to automatically setup your Growtopia server.

[Link]mareteutral - pars tradg wth M []

RealFormer-Pytorch Implementation of RealFormer using pytorch

Implementation of "A Deep Learning Loss Function based on Auditory Power Compression for Speech Enhancement" by pytorch

Experiments with differentiable stacks and queues in PyTorch

Code of the paper "Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition"

HDMapNet: A Local Semantic Map Learning and Evaluation Framework

The official MegEngine implementation of the ICCV 2021 paper: GyroFlow: Gyroscope-Guided Unsupervised Optical Flow Learning

Joint Gaussian Graphical Model Estimation: A Survey

Example how to deploy deep learning model with aiohttp.

An implementation of Geoffrey Hinton's paper "How to represent part-whole hierarchies in a neural network" in Pytorch.

DeRF: Decomposed Radiance Fields

Collection of tasks for fast prototyping, baselining, finetuning and solving problems with deep learning.

This repository is a series of notebooks that show solutions for the projects at Dataquest.io.

A Python toolbox to create adversarial examples that fool neural networks in PyTorch, TensorFlow, and JAX

MonoRCNN is a monocular 3D object detection method for automonous driving

Deep Learning: Architectures & Methods Project: Deep Learning for Audio Super-Resolution