Locate, Size and Count: Accurately Resolving People in Dense Crowds via Detection
Deepak Babu Sam, Skand Vishwanath Peri, Mukuntha Narayanan, Sundararaman, Amogh Kamath, R. Venkatesh Babu

TL;DR
This paper presents LSC-CNN, a detection-based framework for dense crowd counting that localizes and sizes individual heads, outperforming density regression methods in accuracy and localization.
Contribution
The paper introduces a novel detection architecture that localizes each person in dense crowds using only point annotations, with size estimation, improving over existing density regression approaches.
Findings
LSC-CNN achieves superior localization accuracy.
The method outperforms density regressors in counting.
Effective head size estimation from point annotations.
Abstract
We introduce a detection framework for dense crowd counting and eliminate the need for the prevalent density regression paradigm. Typical counting models predict crowd density for an image as opposed to detecting every person. These regression methods, in general, fail to localize persons accurate enough for most applications other than counting. Hence, we adopt an architecture that locates every person in the crowd, sizes the spotted heads with bounding box and then counts them. Compared to normal object or face detectors, there exist certain unique challenges in designing such a detection system. Some of them are direct consequences of the huge diversity in dense crowds along with the need to predict boxes contiguously. We solve these issues and develop our LSC-CNN model, which can reliably detect heads of people across sparse to dense crowds. LSC-CNN employs a multi-column…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Human Mobility and Location-Based Analysis · Evacuation and Crowd Dynamics
