On the Complexity of Learning Sparse Functions with Statistical and   Gradient Queries

Nirmit Joshi; Theodor Misiakiewicz; Nathan Srebro

arXiv:2407.05622·cs.LG·July 9, 2024

On the Complexity of Learning Sparse Functions with Statistical and Gradient Queries

Nirmit Joshi, Theodor Misiakiewicz, Nathan Srebro

PDF

Open Access 1 Video

TL;DR

This paper analyzes the complexity of gradient-based algorithms in learning sparse functions, introducing Differentiable Learning Queries ($ extsf{DLQ}$) to model gradient queries and characterizing their query complexity across different loss functions.

Contribution

It introduces $ extsf{DLQ}$ as a new query model for gradient algorithms and provides a tight characterization of their complexity for learning sparse functions under various loss functions.

Findings

01

$ extsf{DLQ}$ matches $ extsf{CSQ}$ complexity for squared loss.

02

For $ extsf{L}_1$ loss, $ extsf{DLQ}$ has the same complexity as $ extsf{SQ}.

03

$ extsf{DLQ}$ captures the complexity of learning with gradient descent in neural networks.

Abstract

The goal of this paper is to investigate the complexity of gradient algorithms when learning sparse functions (juntas). We introduce a type of Statistical Queries ( $SQ$ ), which we call Differentiable Learning Queries ( $DLQ$ ), to model gradient queries on a specified loss with respect to an arbitrary model. We provide a tight characterization of the query complexity of $DLQ$ for learning the support of a sparse function over generic product distributions. This complexity crucially depends on the loss function. For the squared loss, $DLQ$ matches the complexity of Correlation Statistical Queries $(CSQ)$ --potentially much worse than $SQ$ . But for other simple loss functions, including the $ℓ_{1}$ loss, $DLQ$ always achieves the same complexity as $SQ$ . We also provide evidence that $DLQ$ can indeed capture…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

On the Complexity of Learning Sparse Functions with Statistical and Gradient Queries· slideslive

Taxonomy

TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Face and Expression Recognition