Towards Interpretable Deep Metric Learning with Structural Matching

Wenliang Zhao; Yongming Rao; Ziyi Wang; Jiwen Lu; Jie Zhou

arXiv:2108.05889·cs.CV·August 13, 2021·1 cites

Towards Interpretable Deep Metric Learning with Structural Matching

Wenliang Zhao, Yongming Rao, Ziyi Wang, Jiwen Lu, Jie Zhou

PDF

Open Access 1 Repo

TL;DR

This paper introduces a structural matching approach for deep metric learning that improves interpretability by explicitly aligning spatial features, enabling more transparent similarity assessments between images.

Contribution

The proposed DIML method incorporates explicit spatial alignment into deep metric learning, enhancing interpretability and compatibility with existing models.

Findings

01

Achieves better interpretability of image similarity

02

Substantially improves performance on benchmark datasets

03

Compatible with various backbone networks

Abstract

How do the neural networks distinguish two images? It is of critical importance to understand the matching mechanism of deep models for developing reliable intelligent systems for many risky visual applications such as surveillance and access control. However, most existing deep metric learning methods match the images by comparing feature vectors, which ignores the spatial structure of images and thus lacks interpretability. In this paper, we present a deep interpretable metric learning (DIML) method for more transparent embedding learning. Unlike conventional metric learning methods based on feature vector comparison, we propose a structural matching strategy that explicitly aligns the spatial embeddings by computing an optimal matching flow between feature maps of the two images. Our method enables deep models to learn metrics in a more human-friendly way, where the similarity of two…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wl-zhao/diml
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Video Surveillance and Tracking Methods · Human Pose and Action Recognition