Crowd counting with crowd attention convolutional neural network
Jiwei Chen, Wen Su, Zengfu Wang

TL;DR
This paper introduces CAT-CNN, a novel end-to-end deep learning model that improves crowd counting accuracy by adaptively focusing on human heads using a confidence map, effectively handling scene complexity and scale variation.
Contribution
The paper presents a new crowd counting model that uses a confidence map to guide attention, enhancing density map accuracy and reducing errors caused by scene complexity.
Findings
Outperforms state-of-the-art methods on three challenging datasets.
Effectively reduces misjudgements caused by scene complexity.
Provides a refined density map for accurate crowd estimation.
Abstract
Crowd counting is a challenging problem due to the scene complexity and scale variation. Although deep learning has achieved great improvement in crowd counting, scene complexity affects the judgement of these methods and they usually regard some objects as people mistakenly; causing potentially enormous errors in the crowd counting result. To address the problem, we propose a novel end-to-end model called Crowd Attention Convolutional Neural Network (CAT-CNN). Our CAT-CNN can adaptively assess the importance of a human head at each pixel location by automatically encoding a confidence map. With the guidance of the confidence map, the position of human head in estimated density map gets more attention to encode the final density map, which can avoid enormous misjudgements effectively. The crowd count can be obtained by integrating the final density map. To encode a highly refined…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
