NAS-Count: Counting-by-Density with Neural Architecture Search
Yutao Hu, Xiaolong Jiang, Xuhui Liu, Baochang Zhang, Jungong Han,, Xianbin Cao, David Doermann

TL;DR
This paper introduces NAS-Count, an automated neural architecture search method for crowd counting, resulting in an end-to-end encoder-decoder model that outperforms existing hand-designed models on multiple datasets.
Contribution
The paper presents AMSNet, a novel NAS-designed crowd counting architecture with a counting-specific search space and a new scale pyramid pooling loss, advancing automation in model design.
Findings
AMSNet achieves state-of-the-art results on four datasets.
The NAS approach outperforms hand-designed models.
The scale pyramid pooling loss improves multi-scale feature learning.
Abstract
Most of the recent advances in crowd counting have evolved from hand-designed density estimation networks, where multi-scale features are leveraged to address the scale variation problem, but at the expense of demanding design efforts. In this work, we automate the design of counting models with Neural Architecture Search (NAS) and introduce an end-to-end searched encoder-decoder architecture, Automatic Multi-Scale Network (AMSNet). Specifically, we utilize a counting-specific two-level search space. The encoder and decoder in AMSNet are composed of different cells discovered from micro-level search, while the multi-path architecture is explored through macro-level search. To solve the pixel-level isolation issue in MSE loss, AMSNet is optimized with an auto-searched Scale Pyramid Pooling Loss (SPPLoss) that supervises the multi-scale structural information. Extensive experiments on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Anomaly Detection Techniques and Applications · Human Mobility and Location-Based Analysis
MethodsSigmoid Activation · Tanh Activation · Softmax · Long Short-Term Memory
