PS-ARM: An End-to-End Attention-aware Relation Mixer Network for Person Search
Mustansar Fiaz, Hisham Cholakkal, Sanath Narayan, Rao Muhammad Anwer,, and Fahad Shahbaz Khan

TL;DR
This paper introduces PS-ARM, an attention-aware relation mixer network that enhances person search by capturing discriminative features and improving robustness against appearance variations and occlusions, achieving state-of-the-art results.
Contribution
The paper proposes a novel ARM module with spatial and channel-wise relation mixing and spatio-channel attention, integrated into Faster R-CNN for improved person search performance.
Findings
Achieves state-of-the-art results on CUHKSYSU and PRW datasets.
Gains 5% mAP over SeqNet on PRW dataset.
Operates at a comparable speed to existing methods.
Abstract
Person search is a challenging problem with various real-world applications, that aims at joint person detection and re-identification of a query person from uncropped gallery images. Although, the previous study focuses on rich feature information learning, it is still hard to retrieve the query person due to the occurrence of appearance deformations and background distractors. In this paper, we propose a novel attention-aware relation mixer (ARM) module for person search, which exploits the global relation between different local regions within RoI of a person and make it robust against various appearance deformations and occlusion. The proposed ARM is composed of a relation mixer block and a spatio-channel attention layer. The relation mixer block introduces a spatially attended spatial mixing and a channel-wise attended channel mixing for effectively capturing discriminative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Face recognition and analysis · Human Mobility and Location-Based Analysis
MethodsConvolution · Softmax · RoIPool · Region Proposal Network · Faster R-CNN
