TL;DR
This paper introduces an Attribute-Specific Embedding Network (ASEN) that learns multiple attribute-focused embeddings for fine-grained fashion similarity prediction, enhancing accuracy for applications like fashion copyright protection.
Contribution
The paper proposes a novel ASEN model combining global and local branches with attention modules to improve attribute-specific similarity measurement in fashion images.
Findings
ASEN outperforms existing methods on FashionAI, DARN, and DeepFashion datasets.
The model effectively captures fine-grained attribute similarities.
ASEN demonstrates potential for fashion reranking applications.
Abstract
This paper strives to predict fine-grained fashion similarity. In this similarity paradigm, one should pay more attention to the similarity in terms of a specific design/attribute between fashion items. For example, whether the collar designs of the two clothes are similar. It has potential value in many fashion related applications, such as fashion copyright protection. To this end, we propose an Attribute-Specific Embedding Network (ASEN) to jointly learn multiple attribute-specific embeddings, thus measure the fine-grained similarity in the corresponding space. The proposed ASEN is comprised of a global branch and a local branch. The global branch takes the whole image as input to extract features from a global perspective, while the local branch takes as input the zoomed-in region-of-interest (RoI) w.r.t. the specified attribute thus able to extract more fine-grained features. As…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
