git《Beta R-CNN: Looking into Pedestrian Detection from Another Perspective》(NeurIPS 2020) GitHub:[fig3]

Last update: Sep 08, 2021

Related tags

Overview

Beta R-CNN: Looking into Pedestrian Detection from Another Perspective

This is the pytorch implementation of our paper "[Beta R-CNN: Looking into Pedestrian Detection from Another Perspective]", published in Neurips 2020.

Our method aiming at detecting highly occluded and highly-overlapped instances in crowded scenes especially for pedestrian detection.

Codes are prepared to release here. Due to the experiments are conducted with internal framework, we need some time to rewrite and clean the code. We will release the complete code soon.

Abstract

Recently significant progress has been made in pedestrian detection, but it remains challenging to achieve high performance in occluded and crowded scenes. It could be mostly attributed to the widely used representation of pedestrians, i.e., 2Daxis-aligned bounding box, which just describes the approximate location and size of the object. Bounding box models the object as a uniform distribution within the boundary, making pedestrians indistinguishable in occluded and crowded scenes due to much noise. To eliminate the problem, we propose a novel representation based on 2D beta distribution, named Beta Representation. It pictures a pedestrian by explicitly constructing the relationship between full-body and visible boxes,and emphasizes the center of visual mass by assigning different probability values to pixels. As a result, Beta Representation is much better for distinguishing highly-overlapped instances in crowded scenes with a new NMS strategy named BetaNMS. What’s more, to fully exploit Beta Representation, a novel pipeline Beta R-CNN equipped with BetaHead and BetaMask is proposed, leading to high detection performance in occluded and crowded scenes.

Method

The network structure and some visualization results are shown here:

Citation

@article{BetaRCNN,
  title={Beta R-CNN: Looking into Pedestrian Detection from Another Perspective},
  author={Xu, Zixuan and Li, Banghuai and Yuan, Ye and Dang, Anhong},
  journal={Advances in Neural Information Processing Systems},
  volume={33},
  year={2020}
}

Contact

If you have any questions, please do not hesitate to contact Zixuan Xu ([email protected]).

git《Beta R-CNN: Looking into Pedestrian Detection from Another Perspective》(NeurIPS 2020) GitHub:[fig3]

Related tags

Overview

Beta R-CNN: Looking into Pedestrian Detection from Another Perspective

Abstract

Method

Citation

Contact

Owner

PyTorch implementations of Top-N recommendation, collaborative filtering recommenders.

PyTorch implementation of Decoupling Value and Policy for Generalization in Reinforcement Learning

Digital Twin Mobility Profiling: A Spatio-Temporal Graph Learning Approach

GNN4Traffic - This is the repository for the collection of Graph Neural Network for Traffic Forecasting

A PyTorch implementation of the paper Mixup: Beyond Empirical Risk Minimization in PyTorch

FSL-Mate: A collection of resources for few-shot learning (FSL).

Pytorch Implementation of Spiking Neural Networks Calibration, ICML 2021

Accuracy Aligned. Concise Implementation of Swin Transformer

Official PyTorch implementation of the ICRA 2021 paper: Adversarial Differentiable Data Augmentation for Autonomous Systems.

Implementation of ToeplitzLDA for spatiotemporal stationary time series data.

Chinese Advertisement Board Identification(Pytorch)

Implementation of Bidirectional Recurrent Independent Mechanisms (Learning to Combine Top-Down and Bottom-Up Signals in Recurrent Neural Networks with Attention over Modules)

Awesome-google-colab - Google Colaboratory Notebooks and Repositories

Multi-modal co-attention for drug-target interaction annotation and Its Application to SARS-CoV-2

Towhee is a flexible machine learning framework currently focused on computing deep learning embeddings over unstructured data.

Open-source code for Generic Grouping Network (GGN, CVPR 2022)

TalkingHead-1KH is a talking-head dataset consisting of YouTube videos

A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility

Platform-agnostic AI Framework 🔥

Official code release for "GRAF: Generative Radiance Fields for 3D-Aware Image Synthesis"