Code for ACM MM 2020 paper "NOH-NMS: Improving Pedestrian Detection by Nearby Objects Hallucination"

Last update: Nov 11, 2022

Related tags

Overview

NOH-NMS: Improving Pedestrian Detection by Nearby Objects Hallucination

The offical implementation for the "NOH-NMS: Improving Pedestrian Detection by Nearby Objects Hallucination" which is published in ACM MM 2020.

We propose Nearby Objects Hallucinator (NOH), which pinpoints the objects nearby each proposal with a Gaussian distribution, together with NOH-NMS, which dynamically eases the suppression for the space that might contain other objects with a high likelihood.

This work has won the first place at the CrowdHuman Challenge, 2020.

This repo is implemented based on detectron2.

Performance

Model	Backbone	AP	Recall	MR	Weights
Faster RCNN	ResNet-50	85.0	87.5	44.5	faster_rcnn_model_final.pth
NOH-NMS	ResNet-50	88.8	92.6	43.7	noh_nms_model_final.pth

Prepare Datasets

Download the CrowdHuman Datasets from http://www.crowdhuman.org/, and then move them under the directory like:

./data/crowdhuman
├── annotations
│   └── annotation_train.odgt
│   └── annotation_val.odgt
├── images
│   └── train
│   └── val

Installation

  cd detectron2
  pip install -e . 
  #or rebuild
  sh build.sh

Quick Start

See GETTING_STARTED.md in detectron2

Acknowledgement

detectron2

Citation

if you find this project useful for your research, please cite:

@inproceedings{zhou2020noh,
  title={NOH-NMS: Improving Pedestrian Detection by Nearby Objects Hallucination},
  author={Zhou, Penghao and Zhou, Chong and Peng, Pai and Du, Junlong and Sun, Xing and Guo, Xiaowei and Huang, Feiyue},
  booktitle={Proceedings of the 28th ACM International Conference on Multimedia},
  pages={1967--1975},
  year={2020}
}

Code for ACM MM 2020 paper "NOH-NMS: Improving Pedestrian Detection by Nearby Objects Hallucination"

Related tags

Overview

NOH-NMS: Improving Pedestrian Detection by Nearby Objects Hallucination

Performance

Prepare Datasets

Installation

Quick Start

Acknowledgement

Citation

Owner

Tencent YouTu Research

AITom is an open-source platform for AI driven cellular electron cryo-tomography analysis.

Principled Detection of Out-of-Distribution Examples in Neural Networks

RealTime Emotion Recognizer for Machine Learning Study Jam's demo

OpenL3: Open-source deep audio and image embeddings

This repository contains the code for TABS, a 3D CNN-Transformer hybrid automated brain tissue segmentation algorithm using T1w structural MRI scans

MonoScene: Monocular 3D Semantic Scene Completion

Cross-Modal Contrastive Learning for Text-to-Image Generation

Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch

An addon uses SMPL's poses and global translation to drive cartoon character in Blender.

YOLOv3 in PyTorch > ONNX > CoreML > TFLite

Adaptive, interpretable wavelets across domains (NeurIPS 2021)

Code accompanying "Dynamic Neural Relational Inference" from CVPR 2020

Train Dense Passage Retriever (DPR) with a single GPU

Text Summarization - WCN — Weighted Contextual N-gram method for evaluation of Text Summarization

Code repository for paper `Skeleton Merger: an Unsupervised Aligned Keypoint Detector`.

Implementation of CSRL from the AAAI2022 paper: Constraint Sampling Reinforcement Learning: Incorporating Expertise For Faster Learning

Codebase of deep learning models for inferring stability of mRNA molecules

[ICME 2021 Oral] CORE-Text: Improving Scene Text Detection with Contrastive Relational Reasoning

Hypersearch weight debugging and losses tutorial

FAVD: Featherweight Assisted Vulnerability Discovery