Code of paper Interact, Embed, and EnlargE (IEEE): Boosting Modality-specific Representations for Multi-Modal Person Re-identification.

Last update: Dec 12, 2022

Related tags

Deep Learning AAAI2022-IEEE-for-MMReID

Overview

Interact, Embed, and EnlargE (IEEE): Boosting Modality-specific Representations for Multi-Modal Person Re-identification

We provide the codes for reproducing result of our paper Interact, Embed, and EnlargE (IEEE): Boosting Modality-specific Representations for Multi-Modal Person Re-identification.

Installation

Basic environments: python3.6, pytorch1.8.0, cuda11.1.
Our codes structure is based on Torchreid. (More details can be found in link: https://github.com/KaiyangZhou/deep-person-reid , you can download the packages according to Torchreid requirements.)

# create environment
cd AAAI2022_IEEE/
conda create --name ieeeReid python=3.6
conda activate ieeeReid

# install dependencies
# make sure `which python` and `which pip` point to the correct path
pip install -r requirements.txt

# install torch and torchvision (select the proper cuda version to suit your machine)
conda install pytorch==1.8.0 torchvision==0.9.0 torchaudio==0.8.0 cudatoolkit=11.1 -c pytorch -c conda-forge

# install torchreid (don't need to re-build it if you modify the source code)
python setup.py develop

Get start

You can use the setting in im_r50_softmax_256x128_amsgrad_RGBNT_ieee_part_margin.yaml to get the results of full IEEE.

python ./scripts/mainMultiModal.py --config-file ./configs/im_r50_softmax_256x128_amsgrad_RGBNT_ieee_part_margin.yaml --seed 40

You can run other methods by using following configuration file:

# MLFN
./configs/im_r50_softmax_256x128_amsgrad_RGBNT_mlfn.yaml

# HACNN
./configs/im_r50_softmax_256x128_amsgrad_RGBNT_hacnn.yaml

# OSNet
./configs/im_r50_softmax_256x128_amsgrad_RGBNT_osnet.yaml

# HAMNet
./configs/im_r50_softmax_256x128_amsgrad_RGBNT_hamnet.yaml

# PFNet
./configs/im_r50_softmax_256x128_amsgrad_RGBNT_hamnet.yaml

# full IEEE
./configs/im_r50_softmax_256x128_amsgrad_RGBNT_ieee_part_margin.yaml

Details

The details of our Cross-modal Interacting Module (CIM) and Relation-based Embedding Module (REM) can be found in .\torchreid\models\ieee3modalPart.py. The design of Multi-modal Margin Loss(3M loss) can be found in .\torchreid\losses\multi_modal_margin_loss_new.py.

Ablation study settings.

You can control these two modules and the loss by change the corresponding codes.

Cross-modal Interacting Module (CIM) and Relation-based Embedding Module (REM)

# change the code in .\torchreid\models\ieee3modalPart.py

class IEEE3modalPart(nn.Module):
    def __init__(···
    ):
        modal_number = 3
        fc_dims = [128]
        pooling_dims = 768
        super(IEEE3modalPart, self).__init__()
        self.loss = loss
        self.parts = 6
        
        self.backbone = nn.ModuleList(···
        )
		
		  # using Cross-modal Interacting Module (CIM)
        self.interaction = True
        # using channel attention in CIM
        self.attention = True
        
        # using Relation-based Embedding Module (REM)
        self.using_REM = True
        
        ···

Multi-modal Margin Loss(3M loss)

# change the code in .\configs\your_config_file.yaml

# using Multi-modal Margin Loss(3M loss), you can change the margin by modify the parameter of "ieee_margin".
···
loss:
  name: 'margin'
  softmax:
    label_smooth: True
  ieee_margin: 1
  weight_m: 1.0
  weight_x: 1.0
···

# using only CE loss
···
loss:
  name: 'softmax'
  softmax:
    label_smooth: True
  weight_x: 1.0
···

Code of paper Interact, Embed, and EnlargE (IEEE): Boosting Modality-specific Representations for Multi-Modal Person Re-identification.

Related tags

Overview

Interact, Embed, and EnlargE (IEEE): Boosting Modality-specific Representations for Multi-Modal Person Re-identification

Installation

Get start

Details

Owner

Alternatives to Deep Neural Networks for Function Approximations in Finance

Delta Conformity Sociopatterns Analysis - Delta Conformity Sociopatterns Analysis

CL-Gym: Full-Featured PyTorch Library for Continual Learning

Code for our EMNLP 2021 paper "Learning Kernel-Smoothed Machine Translation with Retrieved Examples"

Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning Source Code

TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

Pytorch implement of 'Unmixing based PAN guided fusion network for hyperspectral imagery'

A pre-trained language model for social media text in Spanish

The open-source and free to use Python package miseval was developed to establish a standardized medical image segmentation evaluation procedure

[CVPR-2021] UnrealPerson: An adaptive pipeline for costless person re-identification

RM Operation can equivalently convert ResNet to VGG, which is better for pruning; and can help RepVGG perform better when the depth is large.

A brand new hub for Scene Graph Generation methods based on MMdetection (2021). The pipeline of from detection, scene graph generation to downstream tasks (e.g., image cpationing) is supported. Pytorch version implementation of HetH (ECCV 2020) and TopicSG (ICCV 2021) is included.

FedML: A Research Library and Benchmark for Federated Machine Learning

A small demonstration of using WebDataset with ImageNet and PyTorch Lightning

Base pretrained models and datasets in pytorch (MNIST, SVHN, CIFAR10, CIFAR100, STL10, AlexNet, VGG16, VGG19, ResNet, Inception, SqueezeNet)

The code repository for "PyCIL: A Python Toolbox for Class-Incremental Learning" in PyTorch.

Reproducing Results from A Hybrid Approach to Targeting Social Assistance

Predict stock movement with Machine Learning and Deep Learning algorithms

Kinetics-Data-Preprocessing

A flexible and extensible framework for gait recognition.

Code of paper Interact, Embed, and EnlargE (IEEE): Boosting Modality-specific Representations for Multi-Modal Person Re-identification.

Related tags

Overview

Interact, Embed, and EnlargE (IEEE): Boosting Modality-specific Representations for Multi-Modal Person Re-identification

Installation

Get start

Details

Owner

Alternatives to Deep Neural Networks for Function Approximations in Finance

Delta Conformity Sociopatterns Analysis - Delta Conformity Sociopatterns Analysis

CL-Gym: Full-Featured PyTorch Library for Continual Learning

Code for our EMNLP 2021 paper "Learning Kernel-Smoothed Machine Translation with Retrieved Examples"

Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning Source Code

​TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

Pytorch implement of 'Unmixing based PAN guided fusion network for hyperspectral imagery'

A pre-trained language model for social media text in Spanish

The open-source and free to use Python package miseval was developed to establish a standardized medical image segmentation evaluation procedure

[CVPR-2021] UnrealPerson: An adaptive pipeline for costless person re-identification

RM Operation can equivalently convert ResNet to VGG, which is better for pruning; and can help RepVGG perform better when the depth is large.

A brand new hub for Scene Graph Generation methods based on MMdetection (2021). The pipeline of from detection, scene graph generation to downstream tasks (e.g., image cpationing) is supported. Pytorch version implementation of HetH (ECCV 2020) and TopicSG (ICCV 2021) is included.

FedML: A Research Library and Benchmark for Federated Machine Learning

A small demonstration of using WebDataset with ImageNet and PyTorch Lightning

Base pretrained models and datasets in pytorch (MNIST, SVHN, CIFAR10, CIFAR100, STL10, AlexNet, VGG16, VGG19, ResNet, Inception, SqueezeNet)

The code repository for "PyCIL: A Python Toolbox for Class-Incremental Learning" in PyTorch.

Reproducing Results from A Hybrid Approach to Targeting Social Assistance

Predict stock movement with Machine Learning and Deep Learning algorithms

Kinetics-Data-Preprocessing

A flexible and extensible framework for gait recognition.

TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.