Dual Relation Alignment for Composed Image Retrieval
Xintong Jiang, Yaxiong Wang, Yujiao Wu, Meng Wang, Xueming Qian

TL;DR
This paper introduces a dual relation alignment framework for composed image retrieval that leverages both explicit and implicit relations among images and text, significantly improving retrieval accuracy.
Contribution
It proposes a novel dual relation alignment method that integrates explicit and implicit relations, addressing a key limitation of existing approaches.
Findings
Enhanced retrieval performance on CIRR and FashionIQ datasets.
Effective integration of implicit relation improves semantic understanding.
Significant performance gains over existing methods.
Abstract
Composed image retrieval, a task involving the search for a target image using a reference image and a complementary text as the query, has witnessed significant advancements owing to the progress made in cross-modal modeling. Unlike the general image-text retrieval problem with only one alignment relation, i.e., image-text, we argue for the existence of two types of relations in composed image retrieval. The explicit relation pertains to the reference image & complementary text-target image, which is commonly exploited by existing methods. Besides this intuitive relation, the observations during our practice have uncovered another implicit yet crucial relation, i.e., reference image & target image-complementary text, since we found that the complementary text can be inferred by studying the relation between the target image and the reference image. Regrettably, existing methods largely…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Image Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques
MethodsFocus
