Code for "Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person Search"

Last update: Dec 03, 2022

Overview

Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person Search

This is an implementation for our paper Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person Search. The code is modified from Github repositoty "pytorch implementation for ECCV2018 paper Deep Cross-Modal Projection Learning for Image-Text Matching".

Requirement

Python 3.7
Pytorch 1.0.0 & torchvision 0.2.1
numpy
matplotlib (not necessary unless the need for the result figure)
scipy 1.2.1
pytorch_transformers

Usage

Data Preparation

Please download CUHK-PEDES dataset .
Put reid_raw.json under project_directory/data/
run data.sh
Copy files test_reid.json, train_reid.json and val_reid.json under CUHK-PEDES/data/ to project_directory/data/processed_data/
Download pretrained Resnet50 model, bert-base-uncased model and vocabulary to project_directory/pretrained/

Training & Testing

You should firstly change the parameter BASE_ROOT to your current directory and IMAGE_DIR to the directory of CUHK-PEDES dataset. Run command sh scripts/train.sh to train the model. Run command sh scripts/test.sh to evaluate the model.

Code for "Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person Search"

Related tags

Overview

Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person Search

Requirement

Usage

Data Preparation

Training & Testing

Model Framework

Model Performance

Owner

Tencent YouTu Research

AdamW optimizer for bfloat16 models in pytorch.

Contrastive Learning of Structured World Models

Code for approximate graph reduction techniques for cardinality-based DSFM, from paper

GUI for TOAD-GAN, a PCG-ML algorithm for Token-based Super Mario Bros. Levels.

Implementation for Curriculum DeepSDF

Digital Twin Mobility Profiling: A Spatio-Temporal Graph Learning Approach

[ WSDM '22 ] On Sampling Collaborative Filtering Datasets

LogAvgExp - Pytorch Implementation of LogAvgExp

This repository is to support contributions for tools for the Project CodeNet dataset hosted in DAX

Code for ICCV 2021 paper "Distilling Holistic Knowledge with Graph Neural Networks"

GLNet for Memory-Efficient Segmentation of Ultra-High Resolution Images

Computer vision - fun segmentation experience using classic and deep tools :)

The official PyTorch code for NeurIPS 2021 ML4AD Paper, "Does Thermal data make the detection systems more reliable?"

ChatBot-Pytorch - A GPT-2 ChatBot implemented using Pytorch and Huggingface-transformers

An Implementation of Transformer in Transformer in TensorFlow for image classification, attention inside local patches

Official code repository for A Simple Long-Tailed Rocognition Baseline via Vision-Language Model.

PyTorch Implementation of Small Lesion Segmentation in Brain MRIs with Subpixel Embedding (ORAL, MICCAIW 2021)

Physical Anomalous Trajectory or Motion (PHANTOM) Dataset

Maximum Spatial Perturbation for Image-to-Image Translation (Official Implementation)

PyTorch implementation(s) of various ResNet models from Twitch streams.