TAPAS: Two-pass Approximate Adaptive Sampling for Softmax

Yu Bai; Sally Goldman; Li Zhang

arXiv:1707.03073·cs.LG·July 17, 2017·5 cites

TAPAS: Two-pass Approximate Adaptive Sampling for Softmax

Yu Bai, Sally Goldman, Li Zhang

PDF

Open Access

TL;DR

TAPAS introduces a two-pass adaptive sampling method for softmax models that efficiently approximates gradients, improving multi-class classification with large label spaces.

Contribution

It proposes a novel two-pass sampling strategy for softmax, with an efficient distributed implementation, enhancing performance on large-scale classification tasks.

Findings

01

Low computational overhead demonstrated on synthetic and real data.

02

Effective in minimizing rank loss for large label spaces.

03

Works well for multi-class classification problems.

Abstract

TAPAS is a novel adaptive sampling method for the softmax model. It uses a two pass sampling strategy where the examples used to approximate the gradient of the partition function are first sampled according to a squashed population distribution and then resampled adaptively using the context and current model. We describe an efficient distributed implementation of TAPAS. We show, on both synthetic data and a large real dataset, that TAPAS has low computational overhead and works well for minimizing the rank loss for multi-class classification problems with a very large label space.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Sparse and Compressive Sensing Techniques · Domain Adaptation and Few-Shot Learning

MethodsSoftmax