TL;DR
The paper introduces PAN, a novel similarity network that leverages pairwise attribute information to improve image similarity measurement, especially when attribute annotations are incomplete or partially relevant.
Contribution
It proposes a joint representation approach that captures similarity conditions and relevance scores, outperforming prior attribute-based methods across multiple image similarity tasks.
Findings
4-9% improvement on clothing compatibility prediction
5% gain in few-shot image classification
Over 1% boost in Recall@1 for clothes retrieval
Abstract
Measuring similarity between two images often requires performing complex reasoning along different axes (e.g., color, texture, or shape). Insights into what might be important for measuring similarity can can be provided by annotated attributes, but prior work tends to view these annotations as complete, resulting in them using a simplistic approach of predicting attributes on single images, which are, in turn, used to measure similarity. However, it is impractical for a dataset to fully annotate every attribute that may be important. Thus, only representing images based on these incomplete annotations may miss out on key information. To address this issue, we propose the Pairwise Attribute-informed similarity Network (PAN), which breaks similarity learning into capturing similarity conditions and relevance scores from a joint representation of two images. This enables our model to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
