Redesigning Multi-Scale Neural Network for Crowd Counting
Zhipeng Du, Miaojing Shi, Jiankang Deng, Stefanos Zafeiriou

TL;DR
This paper presents a novel hierarchical multi-scale neural network for crowd counting that effectively merges multi-scale density maps using density experts, pixel-wise gating, and a relative local counting loss, achieving state-of-the-art results.
Contribution
The work introduces a hierarchical mixture of density experts with pixel-wise soft gating and a relative local counting loss for improved crowd counting accuracy.
Findings
Achieves state-of-the-art performance on five public datasets.
Effectively handles perspective distortions and crowd variations.
Outperforms previous multi-scale methods in accuracy.
Abstract
Perspective distortions and crowd variations make crowd counting a challenging task in computer vision. To tackle it, many previous works have used multi-scale architecture in deep neural networks (DNNs). Multi-scale branches can be either directly merged (e.g. by concatenation) or merged through the guidance of proxies (e.g. attentions) in the DNNs. Despite their prevalence, these combination methods are not sophisticated enough to deal with the per-pixel performance discrepancy over multi-scale density maps. In this work, we redesign the multi-scale neural network by introducing a hierarchical mixture of density experts, which hierarchically merges multi-scale density maps for crowd counting. Within the hierarchical structure, an expert competition and collaboration scheme is presented to encourage contributions from all scales; pixel-wise soft gating nets are introduced to provide…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Anomaly Detection Techniques and Applications · Human Pose and Action Recognition
