New Statistical and Computational Results for Learning Junta Distributions
Lorenzo Beretta

TL;DR
This paper establishes the computational equivalence between learning junta distributions and learning noisy parity functions, and introduces an optimal statistical algorithm for junta learning, highlighting fundamental limits in the field.
Contribution
It proves the equivalence between learning junta distributions and noisy parity functions, and presents an optimal statistical algorithm for junta learning.
Findings
Learning junta distributions is computationally equivalent to learning noisy parity functions.
The proposed algorithm achieves near-optimal statistical complexity.
Computational complexity of the algorithm matches previous non-sample-optimal methods.
Abstract
We study the problem of learning junta distributions on , where a distribution is a -junta if its probability mass function depends on a subset of at most variables. We make two main contributions: - We show that learning -junta distributions is \emph{computationally} equivalent to learning -parity functions with noise (LPN), a landmark problem in computational learning theory. - We design an algorithm for learning junta distributions whose statistical complexity is optimal, up to polylogarithmic factors. Computationally, our algorithm matches the complexity of previous (non-sample-optimal) algorithms. Combined, our two contributions imply that our algorithm cannot be significantly improved, statistically or computationally, barring a breakthrough for LPN.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
