Global-Local Context Network for Person Search

Last update: Oct 17, 2022

Related tags

Overview

Global-Local Context Network for Person Search

Abstract:

Person search aims to jointly localize and identify a query person from natural, uncropped images, which has been actively studied in the computer vision community over the past few years. In this paper, we delve into the rich context information globally and locally surrounding the target person, which we refer to scene and group context,respectively. Unlike previous works that treat the two types of context individually, we exploit them in a unified global-local context network (GLCNet) with the intuitive aim of feature enhancement. Specifically, re-ID embeddings and context features are enhanced simultaneously in a multi-stage fashion, ultimately leading to enhanced, discriminative features for person search. We conduct the experiments on two person search benchmarks (i.e., CUHK-SYSU and PRW) as well as extend our approach to a more challenging setting (i.e., character search on MovieNet). Extensive experimental results demonstrate the consistent improvement of the proposed GLCNet over the state-of-the-art methods on the three datasets.

Overall architecture of our GLCNet:

Performance

Datasets	CUHK-SYSU	CUHK-SYSU	PRW	PRW
Methods	mAP	top-1	mAP	top-1
OIM	75.5	78.7	21.3	49.4
NAE+	92.1	92.9	44.0	81.1
TCTS	93.9	95.1	46.8	87.5
AlignPS+	94.0	94.5	46.1	82.1
SeqNet+CBGM	94.8	95.7	47.6	87.6
GLCNet	95.7	96.3	46.9	85.1
GLCNet+CBGM	96.0	96.3	47.6	88.0

Different gallery size on CUHK-SYSU:

Qualitative Results:

Train

sh ./run_${DATASET}.sh

Test

sh ./test_${DATASET}.sh

Inference

Run the demo.py to make inference on given images. GLCNet runs at 10.3 fps on a single Tesla V100 GPU with batch_size 3.

MovieNet-CS

To extend person search framework to a more challenging task, i.e., character search (CS). We borrow the character detection and ID annotations from the MovieNet dataset to organize MovieNet-CS, and set different levels of training set and different gallery size same as CUHK-SYSU. MovieNet-CS is saved exactly the same format and structure as CUHK-SYSU, which could be of great convenience to further research and experiments. If you want to use MovieNet-CS, please download movie frames on the official website of MovieNet and our reorganized annotations here(TBD).

Acknowledgement

Thanks to the solid codebase from SeqNet.

Citation

@ARTICLE{2021arXiv211202500Z,
    author   = {Peng Zheng and
                Jie Qin and
                Yichao Yan and
                Shengcai Liao and
                Bingbing Ni and
                Xiaogang Cheng and
                Ling Shao},
    title    = {Global-Local Context Network for Person Search},
    journal  = {arXiv e-prints},
    volume   = {abs/2109.00211},
    year     = {2021}
}

Global-Local Context Network for Person Search

Related tags

Overview

Global-Local Context Network for Person Search

Performance

Train

Test

Inference

MovieNet-CS

Acknowledgement

Citation

Owner

Peng Zheng

SCU OlympicsRunning Baseline

An official implementation of "SFNet: Learning Object-aware Semantic Correspondence" (CVPR 2019, TPAMI 2020) in PyTorch.

GUI for TOAD-GAN, a PCG-ML algorithm for Token-based Super Mario Bros. Levels.

Implementation for the EMNLP 2021 paper "Interactive Machine Comprehension with Dynamic Knowledge Graphs".

[CVPR 21] Vectorization and Rasterization: Self-Supervised Learning for Sketch and Handwriting, IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2021.

Flexible Option Learning - NeurIPS 2021

Easy-to-use,Modular and Extendible package of deep-learning based CTR models .

A very simple tool to rewrite parameters such as attributes and constants for OPs in ONNX models. Simple Attribute and Constant Modifier for ONNX.

DiffStride: Learning strides in convolutional neural networks

Repository for MeshTalk supplemental material and code once the (already approved) 16 GHS captures our lab will make publicly available are released.

Pytorch Implementation for Dilated Continuous Random Field

Pytorch tutorials for Neural Style transfert

IMBENS: class-imbalanced ensemble learning in Python.

A bare-bones Python library for quality diversity optimization.

Multi-Stage Progressive Image Restoration

A scikit-learn compatible neural network library that wraps PyTorch

TensorFlow (v2.7.0) benchmark results on an M1 Macbook Air 2020 laptop (macOS Monterey v12.1).

Learning from Synthetic Shadows for Shadow Detection and Removal [Inoue+, IEEE TCSVT 2020].

A high-performance anchor-free YOLO. Exceeding yolov3~v5 with ONNX, TensorRT, NCNN, and Openvino supported.

PolyphonicFormer: Unified Query Learning for Depth-aware Video Panoptic Segmentation