Classification is a Strong Baseline for Deep Metric Learning

Andrew Zhai; Hao-Yu Wu

arXiv:1811.12649·cs.CV·August 6, 2019·37 cites

Classification is a Strong Baseline for Deep Metric Learning

Andrew Zhai, Hao-Yu Wu

PDF

Open Access 2 Repos

TL;DR

This paper demonstrates that classification-based methods are a competitive and scalable approach for deep metric learning tasks like image retrieval and face verification, challenging the dominance of triplet-based methods.

Contribution

The study shows the effectiveness of classification-based deep metric learning across multiple datasets and explores techniques for scalability and efficiency.

Findings

01

Classification-based approaches perform competitively on standard retrieval datasets.

02

Subsampling classes can improve scalability without sacrificing accuracy.

03

Binarization enables efficient storage and computation.

Abstract

Deep metric learning aims to learn a function mapping image pixels to embedding feature vectors that model the similarity between images. Two major applications of metric learning are content-based image retrieval and face verification. For the retrieval tasks, the majority of current state-of-the-art (SOTA) approaches are triplet-based non-parametric training. For the face verification tasks, however, recent SOTA approaches have adopted classification-based parametric training. In this paper, we look into the effectiveness of classification based approaches on image retrieval datasets. We evaluate on several standard retrieval datasets such as CAR-196, CUB-200-2011, Stanford Online Product, and In-Shop datasets for image retrieval and clustering, and establish that our classification-based approach is competitive across different feature dimensions and base feature networks. We further…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Face recognition and analysis · Domain Adaptation and Few-Shot Learning

MethodsAverage Pooling · Residual Connection · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Bottleneck Residual Block · Global Average Pooling · Residual Block · Kaiming Initialization · Max Pooling