Transformer-Based Dual-Optical Attention Fusion Crowd Head Point Counting and Localization Network
Fei Zhou, Yi Li, Mingqing Zhu

TL;DR
This paper introduces TAPNet, a dual-optical attention fusion network that enhances crowd head counting and localization in complex scenes by integrating infrared data and adaptive feature fusion, outperforming existing methods.
Contribution
The paper proposes a novel dual-optical attention fusion model with adaptive feature decomposition, improving accuracy and robustness in crowd counting under challenging conditions.
Findings
Outperforms existing methods on DroneRGBT and GAIIC2 datasets.
Effective in dense, low-light, and occluded crowd scenes.
Enhanced localization accuracy despite image misalignment.
Abstract
In this paper, the dual-optical attention fusion crowd head point counting model (TAPNet) is proposed to address the problem of the difficulty of accurate counting in complex scenes such as crowd dense occlusion and low light in crowd counting tasks under UAV view. The model designs a dual-optical attention fusion module (DAFP) by introducing complementary information from infrared images to improve the accuracy and robustness of all-day crowd counting. In order to fully utilize different modal information and solve the problem of inaccurate localization caused by systematic misalignment between image pairs, this paper also proposes an adaptive two-optical feature decomposition fusion module (AFDF). In addition, we optimize the training strategy to improve the model robustness through spatial random offset data augmentation. Experiments on two challenging public datasets, DroneRGBT and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Fire Detection and Safety Systems · Advanced Neural Network Applications
MethodsSoftmax · Attention Is All You Need
