Reconstruction Regularized Deep Metric Learning for Multi-label Image Classification
Changsheng Li, Chong Liu, Lixin Duan, Peng Gao, Kai Zheng

TL;DR
This paper introduces a deep metric learning approach with a reconstruction regularization for improved multi-label image classification, effectively capturing label correlations and image-label relationships in a learned embedding space.
Contribution
It proposes a novel two-way deep distance metric and a label reconstruction module, enhancing multi-label classification performance over existing methods.
Findings
Outperforms state-of-the-art methods on benchmark datasets.
Effectively models label correlations and image-label relationships.
End-to-end trainable framework with improved accuracy.
Abstract
In this paper, we present a novel deep metric learning method to tackle the multi-label image classification problem. In order to better learn the correlations among images features, as well as labels, we attempt to explore a latent space, where images and labels are embedded via two unique deep neural networks, respectively. To capture the relationships between image features and labels, we aim to learn a \emph{two-way} deep distance metric over the embedding space from two different views, i.e., the distance between one image and its labels is not only smaller than those distances between the image and its labels' nearest neighbors, but also smaller than the distances between the labels and other images corresponding to the labels' nearest neighbors. Moreover, a reconstruction module for recovering correct labels is incorporated into the whole framework as a regularization term, such…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning · Image Retrieval and Classification Techniques
