Semantic Granularity Metric Learning for Visual Search

Dipu Manandhar; Muhammet Bastan; Kim-Hui Yap

arXiv:1911.06047·cs.CV·May 17, 2021

Semantic Granularity Metric Learning for Visual Search

Dipu Manandhar, Muhammet Bastan, Kim-Hui Yap

PDF

TL;DR

This paper introduces a semantic granularity metric learning approach that leverages attribute semantic space and a novel loss function to improve visual search by capturing multiple levels of similarity.

Contribution

It proposes a new deep metric learning framework that models multiple granularities of visual similarity using attribute semantics and a specialized loss function.

Findings

01

Outperforms recent state-of-the-art methods by 1-4.5% in Recall@1 on DeepFashion In-Shop dataset.

02

Effectively captures multiple levels of visual similarity.

03

Demonstrates improved retrieval performance on benchmark datasets.

Abstract

Deep metric learning applied to various applications has shown promising results in identification, retrieval and recognition. Existing methods often do not consider different granularity in visual similarity. However, in many domain applications, images exhibit similarity at multiple granularities with visual semantic concepts, e.g. fashion demonstrates similarity ranging from clothing of the exact same instance to similar looks/design or a common category. Therefore, training image triplets/pairs used for metric learning inherently possess different degree of information. However, the existing methods often treats them with equal importance during training. This hinders capturing the underlying granularities in feature similarity required for effective visual search. In view of this, we propose a new deep semantic granularity metric learning (SGML) that develops a novel idea of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.