Part-Guided Attention Learning for Vehicle Instance Retrieval
Xinyu Zhang, Rufeng Zhang, Jiewei Cao, Dong Gong, Mingyu You, Chunhua, Shen

TL;DR
This paper introduces PGAN, a novel vehicle retrieval method combining part-guided bottom-up and top-down attention to improve discriminative feature learning, achieving state-of-the-art results on large datasets.
Contribution
The paper proposes a Part-Guided Attention Network that detects vehicle parts and salient regions, adaptively weights their importance, and integrates global and part features for enhanced retrieval accuracy.
Findings
Achieves new state-of-the-art performance on four large-scale datasets.
Effectively combines bottom-up and top-down attention mechanisms.
Improves discriminative feature learning for vehicle instance retrieval.
Abstract
Vehicle instance retrieval often requires one to recognize the fine-grained visual differences between vehicles. Besides the holistic appearance of vehicles which is easily affected by the viewpoint variation and distortion, vehicle parts also provide crucial cues to differentiate near-identical vehicles. Motivated by these observations, we introduce a Part-Guided Attention Network (PGAN) to pinpoint the prominent part regions and effectively combine the global and part information for discriminative feature learning. PGAN first detects the locations of different part components and salient regions regardless of the vehicle identity, which serve as the bottom-up attention to narrow down the possible searching regions. To estimate the importance of detected parts, we propose a Part Attention Module (PAM) to adaptively locate the most discriminative regions with high-attention weights and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications
