Heated-Up Softmax Embedding

Xu Zhang; Felix Xinnan Yu; Svebor Karaman; Wei Zhang; Shih-Fu Chang

arXiv:1809.04157·cs.LG·September 13, 2018·39 cites

Heated-Up Softmax Embedding

Xu Zhang, Felix Xinnan Yu, Svebor Karaman, Wei Zhang, Shih-Fu Chang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a 'heating-up' strategy that progressively increases the temperature in softmax classifiers, resulting in more effective embeddings for metric learning tasks, achieving state-of-the-art results.

Contribution

The paper proposes a novel temperature scheduling method during training that enhances embedding quality for metric learning.

Findings

01

Higher softmax temperatures produce more spread-out features.

02

Heating-up training improves embedding compactness and separability.

03

State-of-the-art performance on multiple benchmarks.

Abstract

Metric learning aims at learning a distance which is consistent with the semantic meaning of the samples. The problem is generally solved by learning an embedding for each sample such that the embeddings of samples of the same category are compact while the embeddings of samples of different categories are spread-out in the feature space. We study the features extracted from the second last layer of a deep neural network based classifier trained with the cross entropy loss on top of the softmax layer. We show that training classifiers with different temperature values of softmax function leads to features with different levels of compactness. Leveraging these insights, we propose a "heating-up" strategy to train a classifier with increasing temperatures, leading the corresponding embeddings to achieve state-of-the-art performance on a variety of metric learning benchmarks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ColumbiaDVMM/Heated_Up_Softmax_Embedding
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Visual Attention and Saliency Detection · Advanced Neural Network Applications

MethodsSoftmax