MARS: Learning Modality-Agnostic Representation for Scalable Cross-media Retrieva

Last update: Aug 24, 2022

Related tags

Deep Learning MARS_TCSVT2021

Overview

Introduction

This is the source code of our TCSVT 2021 paper "MARS: Learning Modality-Agnostic Representation for Scalable Cross-media Retrieval". Please cite the following paper if you use our code.

Yunbo Wang and Yuxin Peng, "MARS: Learning Modality-Agnostic Representation for Scalable Cross-media Retrieval", IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2021.

Preparation

We use Python 3.7.2, PyTorch 1.1.0, cuda 9.0, and evaluate on Ubuntu 16.04.12

Install anaconda downloaded from https://repo.anaconda.com/archive. And create a new environment sh Anaconda3-2018.12-Linux-x86_64.sh conda create -n MARS python=3.7.2 conda activate MARS
Run the followed commands conda install pytorch==1.1.0 torchvision==0.3.0 cudatoolkit=9.0 -c pytorch pip install -r requirements.txt

Training and evaluation

We use the Wikipedia dataset as example, and the data is placed in ./datasets/Wiki. In addition, the XMedia&XMediaNet datasets are obtiand via http://59.108.48.34/tiki/XMediaNet/. The NUS-WIDE dataset is obtained via https://lms.comp.nus.edu.sg/wp-content/uploads/2019/research/nuswide/NUS-WIDE.html.

Run the followed command for traning&evaluation, and the configure can be found in main_MARS.py. python main_MARS.py --datasets wiki --output_shape 128 --batch_size 64 --epochs 50 --lr [1e-4, 5e-4] # for Wikipedia

The common representations can be found in folder "features".

For any questions, fell free to contact us. ([email protected])

Welcome to our Laboratory Homepage for more information.

MARS: Learning Modality-Agnostic Representation for Scalable Cross-media Retrieva

Related tags

Overview

Introduction

Preparation

Training and evaluation

Owner

C3d-pytorch - Pytorch porting of C3D network, with Sports1M weights

Pytorch Lightning Implementation of SC-Depth Methods.

Keras Implementation of The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation by (Simon Jégou, Michal Drozdzal, David Vazquez, Adriana Romero, Yoshua Bengio)

A Probabilistic End-To-End Task-Oriented Dialog Model with Latent Belief States towards Semi-Supervised Learning

ESGD-M - A stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch

Minimal PyTorch implementation of YOLOv3

Code release for paper: The Boombox: Visual Reconstruction from Acoustic Vibrations

This reposityory contains the PyTorch implementation of our paper "Generative Dynamic Patch Attack".

A deep learning library that makes face recognition efficient and effective

RE3: State Entropy Maximization with Random Encoders for Efficient Exploration

Computer Vision Paper Reviews with Key Summary of paper, End to End Code Practice and Jupyter Notebook converted papers

Madanalysis5 - A package for event file analysis and recasting of LHC results

Code implementation of Data Efficient Stagewise Knowledge Distillation paper.

A Runtime method overload decorator which should behave like a compiled language

A brand new hub for Scene Graph Generation methods based on MMdetection (2021). The pipeline of from detection, scene graph generation to downstream tasks (e.g., image cpationing) is supported. Pytorch version implementation of HetH (ECCV 2020) and TopicSG (ICCV 2021) is included.

Artificial Neural network regression model to predict the energy output in a combined cycle power plant.

Brax is a differentiable physics engine that simulates environments made up of rigid bodies, joints, and actuators

Implementation of the GBST block from the Charformer paper, in Pytorch

Joint Learning of 3D Shape Retrieval and Deformation, CVPR 2021

A curated list of automated deep learning (including neural architecture search and hyper-parameter optimization) resources.