Binary Hypothesis Testing for Softmax Models and Leverage Score Models

Yuzhou Gu; Zhao Song; Junze Yin

arXiv:2405.06003·stat.ML·June 2, 2025

Binary Hypothesis Testing for Softmax Models and Leverage Score Models

Yuzhou Gu, Zhao Song, Junze Yin

PDF

Open Access 1 Video

TL;DR

This paper investigates the sample complexity of binary hypothesis testing for softmax and leverage score models, establishing asymptotic bounds and drawing analogies between these models in machine learning and linear algebra.

Contribution

It introduces the first analysis of sample complexity for hypothesis testing in softmax models and relates these results to leverage score models, providing new theoretical insights.

Findings

01

Sample complexity is asymptotically O(ε^{-2}) for softmax models.

02

Similar bounds are derived for leverage score models.

03

Establishes an analogy between softmax and leverage score models.

Abstract

Softmax distributions are widely used in machine learning, including Large Language Models (LLMs), where the attention unit uses softmax distributions. We abstract the attention unit as the softmax model, where given a vector input, the model produces an output drawn from the softmax distribution (which depends on the vector input). We consider the fundamental problem of binary hypothesis testing in the setting of softmax models. That is, given an unknown softmax model, which is known to be one of the two given softmax models, how many queries are needed to determine which one is the truth? We show that the sample complexity is asymptotically $O (ϵ^{- 2})$ where $ϵ$ is a certain distance between the parameters of the models. Furthermore, we draw an analogy between the softmax model and the leverage score model, an important tool for algorithm design in linear algebra and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Binary Hypothesis Testing for Softmax Models and Leverage Score Models· slideslive

Taxonomy

TopicsForecasting Techniques and Applications · Stock Market Forecasting Methods · Neural Networks and Applications

MethodsSoftmax