Fashion Image Retrieval with Multi-Granular Alignment
Jinkuan Zhu, Hao Huang, Qiao Deng, Xiyao Li

TL;DR
This paper introduces a novel fashion image retrieval approach that combines global and fine-grained features through multi-granular alignment, significantly improving retrieval accuracy on public datasets.
Contribution
The paper proposes a new method called Multi-Granular Alignment (MGA) that captures detailed clothing patterns and aligns features at multiple granularities, enhancing retrieval performance.
Findings
MGA outperforms state-of-the-art methods by 1.8% and 0.6% in R@1 on DeepFashion sub-tasks.
The proposed FGA effectively captures fine-grained clothing details.
ATA enables coarse-to-fine feature alignment, improving retrieval accuracy.
Abstract
Fashion image retrieval task aims to search relevant clothing items of a query image from the gallery. The previous recipes focus on designing different distance-based loss functions, pulling relevant pairs to be close and pushing irrelevant images apart. However, these methods ignore fine-grained features (e.g. neckband, cuff) of clothing images. In this paper, we propose a novel fashion image retrieval method leveraging both global and fine-grained features, dubbed Multi-Granular Alignment (MGA). Specifically, we design a Fine-Granular Aggregator(FGA) to capture and aggregate detailed patterns. Then we propose Attention-based Token Alignment (ATA) to align image features at the multi-granular level in a coarse-to-fine manner. To prove the effectiveness of our proposed method, we conduct experiments on two sub-tasks (In-Shop & Consumer2Shop) of the public fashion datasets DeepFashion.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis · Image Retrieval and Classification Techniques
MethodsALIGN
