git《Beta R-CNN: Looking into Pedestrian Detection from Another Perspective》(NeurIPS 2020) GitHub:[fig3]

Last update: Sep 08, 2021

Related tags

Overview

Beta R-CNN: Looking into Pedestrian Detection from Another Perspective

This is the pytorch implementation of our paper "[Beta R-CNN: Looking into Pedestrian Detection from Another Perspective]", published in Neurips 2020.

Our method aiming at detecting highly occluded and highly-overlapped instances in crowded scenes especially for pedestrian detection.

Codes are prepared to release here. Due to the experiments are conducted with internal framework, we need some time to rewrite and clean the code. We will release the complete code soon.

Abstract

Recently significant progress has been made in pedestrian detection, but it remains challenging to achieve high performance in occluded and crowded scenes. It could be mostly attributed to the widely used representation of pedestrians, i.e., 2Daxis-aligned bounding box, which just describes the approximate location and size of the object. Bounding box models the object as a uniform distribution within the boundary, making pedestrians indistinguishable in occluded and crowded scenes due to much noise. To eliminate the problem, we propose a novel representation based on 2D beta distribution, named Beta Representation. It pictures a pedestrian by explicitly constructing the relationship between full-body and visible boxes,and emphasizes the center of visual mass by assigning different probability values to pixels. As a result, Beta Representation is much better for distinguishing highly-overlapped instances in crowded scenes with a new NMS strategy named BetaNMS. What’s more, to fully exploit Beta Representation, a novel pipeline Beta R-CNN equipped with BetaHead and BetaMask is proposed, leading to high detection performance in occluded and crowded scenes.

Method

The network structure and some visualization results are shown here:

Citation

@article{BetaRCNN,
  title={Beta R-CNN: Looking into Pedestrian Detection from Another Perspective},
  author={Xu, Zixuan and Li, Banghuai and Yuan, Ye and Dang, Anhong},
  journal={Advances in Neural Information Processing Systems},
  volume={33},
  year={2020}
}

Contact

If you have any questions, please do not hesitate to contact Zixuan Xu ([email protected]).

git《Beta R-CNN: Looking into Pedestrian Detection from Another Perspective》(NeurIPS 2020) GitHub:[fig3]

Related tags

Overview

Beta R-CNN: Looking into Pedestrian Detection from Another Perspective

Abstract

Method

Citation

Contact

Owner

NasirKhusraw - The TSP solved using genetic algorithm and show TSP path overlaid on a map of the Iran provinces & their capitals.

Diagnostic tests for linguistic capacities in language models

💡 Learnergy is a Python library for energy-based machine learning models.

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.

Ready-to-use code and tutorial notebooks to boost your way into few-shot image classification.

🤖 Project template for your next awesome AI project. 🦾

Collection of tasks for fast prototyping, baselining, finetuning and solving problems with deep learning.

Deploying PyTorch Model to Production with FastAPI in CUDA-supported Docker

Sub-tomogram-Detection - Deep learning based model for Cyro ET Sub-tomogram-Detection

This repository includes different versions of the prescribed-time controller as Simulink blocks and MATLAB script codes for engineering applications.

CausaLM: Causal Model Explanation Through Counterfactual Language Models

A python package for generating, analyzing and visualizing building shadows

Fiddle is a Python-first configuration library particularly well suited to ML applications.

Code for CMaskTrack R-CNN (proposed in Occluded Video Instance Segmentation)

Some experiments with tennis player aging curves using Hilbert space GPs in PyMC. Only experimental for now.

Clinica is a software platform for clinical research studies involving patients with neurological and psychiatric diseases and the acquisition of multimodal data

Rl-quickstart - Reinforcement Learning Quickstart

StyleTransfer - Open source style transfer project, based on VGG19

Benchmark tools for Compressive LiDAR-to-map registration

Diverse Branch Block: Building a Convolution as an Inception-like Unit