An Improved Dilated Convolutional Network for Herd Counting in Crowded Scenes
Soufien Hamrouni, Hakim Ghazzai, Hamid Menouar, Yahya Massoud

TL;DR
This paper introduces an improved crowd counting model using a dual deep learning architecture with dilated convolutions and genetic algorithm optimization, achieving faster convergence and higher accuracy in congested scenes.
Contribution
The paper presents a novel two-part CNN system with optimized dilated rates for crowd counting, outperforming existing methods in speed and accuracy.
Findings
30% faster convergence than state-of-the-art
20% lower MAE on Shanghai dataset
Effective in highly congested scenes
Abstract
Crowd management technologies that leverage computer vision are widespread in contemporary times. There exists many security-related applications of these methods, including, but not limited to: following the flow of an array of people and monitoring large gatherings. In this paper, we propose an accurate monitoring system composed of two concatenated convolutional deep learning architectures. The first part called Front-end, is responsible for converting bi-dimensional signals and delivering high-level features. The second part, called the Back-end, is a dilated Convolutional Neural Network (CNN) used to replace pooling layers. It is responsible for enlarging the receptive field of the whole network and converting the descriptors provided by the first network to a saliency map that will be utilized to estimate the number of people in highly congested images. We also propose to utilize…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
