New Statistical and Computational Results for Learning Junta Distributions

Lorenzo Beretta

arXiv:2505.05819·cs.LG·July 15, 2025

New Statistical and Computational Results for Learning Junta Distributions

Lorenzo Beretta

PDF

TL;DR

This paper establishes the computational equivalence between learning junta distributions and learning noisy parity functions, and introduces an optimal statistical algorithm for junta learning, highlighting fundamental limits in the field.

Contribution

It proves the equivalence between learning junta distributions and noisy parity functions, and presents an optimal statistical algorithm for junta learning.

Findings

01

Learning junta distributions is computationally equivalent to learning noisy parity functions.

02

The proposed algorithm achieves near-optimal statistical complexity.

03

Computational complexity of the algorithm matches previous non-sample-optimal methods.

Abstract

We study the problem of learning junta distributions on ${0, 1}^{n}$ , where a distribution is a $k$ -junta if its probability mass function depends on a subset of at most $k$ variables. We make two main contributions: - We show that learning $k$ -junta distributions is \emph{computationally} equivalent to learning $k$ -parity functions with noise (LPN), a landmark problem in computational learning theory. - We design an algorithm for learning junta distributions whose statistical complexity is optimal, up to polylogarithmic factors. Computationally, our algorithm matches the complexity of previous (non-sample-optimal) algorithms. Combined, our two contributions imply that our algorithm cannot be significantly improved, statistically or computationally, barring a breakthrough for LPN.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.