Bayesian Multi-Scale Neural Network for Crowd Counting
Abhinav Sagar

TL;DR
This paper introduces a Bayesian multi-scale neural network architecture for crowd counting that effectively handles scale and perspective variations, providing accurate density maps and uncertainty estimates in congested scenes.
Contribution
The work presents a novel deep learning model with a Perspective-aware Aggregation Module and Bayesian inference for improved crowd counting and uncertainty quantification.
Findings
Achieves state-of-the-art performance on benchmark datasets
Provides reliable uncertainty estimates for crowd counts
Effectively handles scale and perspective variations
Abstract
Crowd counting is a challenging yet critical task in computer vision with applications ranging from public safety to urban planning. Recent advances using Convolutional Neural Networks (CNNs) that estimate density maps have shown significant success. However, accurately counting individuals in highly congested scenes remains an open problem due to severe occlusions, scale variations, and perspective distortions, where people appear at drastically different sizes across the image. In this work, we propose a novel deep learning architecture that effectively addresses these challenges. Our network integrates a ResNet-based feature extractor for capturing rich hierarchical representations, followed by a downsampling block employing dilated convolutions to preserve spatial resolution while expanding the receptive field. An upsampling block using transposed convolutions reconstructs the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Anomaly Detection Techniques and Applications · Human Pose and Action Recognition
MethodsResidual Block · Convolution · Batch Normalization · Average Pooling · 1x1 Convolution · Residual Connection · Global Average Pooling · Kaiming Initialization · *Communicated@Fast*How Do I Communicate to Expedia? · Max Pooling
