Code for AAAI 2021 paper: Sequential End-to-end Network for Efficient Person Search

Last update: Dec 31, 2022

Related tags

Overview

This repository hosts the source code of our paper: [AAAI 2021]Sequential End-to-end Network for Efficient Person Search. SeqNet achieves the state-of-the-art performance on two widely used benchmarks and runs at 11.5 FPS on a single GPU. You can find a brief Chinese introduction at zhihu.

Performance profile:

Dataset	mAP	Top-1	Model
CUHK-SYSU	94.8	95.7	model
PRW	47.6	87.6	model

The network structure is simple and suitable as baseline:

Installation

Run pip install -r requirements.txt in the root directory of the project.

Quick Start

Let's say $ROOT is the root directory.

Download CUHK-SYSU and PRW datasets, and unzip them to $ROOT/data

$ROOT/data
├── CUHK-SYSU
└── PRW

Following the link in the above table, download our pretrained model to anywhere you like, e.g., $ROOT/exp_cuhk
Evaluate its performance by specifing the paths of checkpoint and corresponding configuration file.

python train.py --cfg $ROOT/exp_cuhk/config.yaml --eval --ckpt $ROOT/exp_cuhk/epoch_19.pth

Training

Pick one configuration file you like in $ROOT/configs, and run with it.

python train.py --cfg configs/cuhk_sysu.yaml

Note: At present, our script only supports single GPU training, but distributed training will be also supported in future. By default, the batch size and the learning rate during training are set to 5 and 0.003 respectively, which requires about 28GB of GPU memory. If your GPU cannot provide the required memory, try smaller batch size and learning rate (performance may degrade). Specifically, your setting should follow the Linear Scaling Rule: When the minibatch size is multiplied by k, multiply the learning rate by k. For example:

python train.py --cfg configs/cuhk_sysu.yaml INPUT.BATCH_SIZE_TRAIN 2 SOLVER.BASE_LR 0.0012

Tip: If the training process stops unexpectedly, you can resume from the specified checkpoint.

python train.py --cfg configs/cuhk_sysu.yaml --resume --ckpt /path/to/your/checkpoint

Test

Suppose the output directory is $ROOT/exp_cuhk. Test the trained model:

python train.py --cfg $ROOT/exp_cuhk/config.yaml --eval --ckpt $ROOT/exp_cuhk/epoch_19.pth

Test with Context Bipartite Graph Matching algorithm:

python train.py --cfg $ROOT/exp_cuhk/config.yaml --eval --ckpt $ROOT/exp_cuhk/epoch_19.pth EVAL_USE_CBGM True

Test the upper bound of the person search performance by using GT boxes:

python train.py --cfg $ROOT/exp_cuhk/config.yaml --eval --ckpt $ROOT/exp_cuhk/epoch_19.pth EVAL_USE_GT True

Pull Request

Pull request is welcomed! Before submitting a PR, DO NOT forget to run ./dev/linter.sh that provides syntax checking and code style optimation.

Citation

@inproceedings{li2021sequential,
  title={Sequential End-to-end Network for Efficient Person Search},
  author={Li, Zhengjia and Miao, Duoqian},
  booktitle={Proceedings of the AAAI conference on artificial intelligence},
  year={2021}
}

Code for AAAI 2021 paper: Sequential End-to-end Network for Efficient Person Search

Related tags

Overview

Installation

Quick Start

Training

Test

Pull Request

Citation

Owner

Zj Li

Unofficial implementation of "TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Document Images"

SceneCollisionNet This repo contains the code for "Object Rearrangement Using Learned Implicit Collision Functions", an ICRA 2021 paper. For more info

Code for the "Sensing leg movement enhances wearable monitoring of energy expenditure" paper.

TextBoxes re-implement using tensorflow

Motion detector, Full body detection, Upper body detection, Cat face detection, Smile detection, Face detection (haar cascade), Silverware detection, Face detection (lbp), and Sending email notifications

Face Detection with DLIB

Pre-Recognize Library - library with algorithms for improving OCR quality.

Code related to "Have Your Text and Use It Too! End-to-End Neural Data-to-Text Generation with Semantic Fidelity" paper

OpenCVを用いたカメラキャリブレーションのサンプルです。2021/06/21時点でPython実装のある3種類(通常カメラ向け、魚眼レンズ向け(fisheyeモジュール)、全方位カメラ向け(omnidirモジュール))について用意しています。

SemTorch

Smart computer vision application

Give a solution to recognize MaoYan font.

This repository contains codes on how to handle mouse event using OpenCV

TextField: Learning A Deep Direction Field for Irregular Scene Text Detection (TIP 2019)

Code for AAAI 2021 paper: Sequential End-to-end Network for Efficient Person Search

Shape Detection - It's a shape detection project with OpenCV and Python.

Code for the paper "DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks" (ICCV '19)

Thresholding-and-masking-using-OpenCV - Image Thresholding is used for image segmentation

Zoom , GoogleMeets에서 Vtuber 데뷔하기

ISI's Optical Character Recognition (OCR) software for machine-print and handwriting data