Optimal Learners for Realizable Regression: PAC Learning and Online Learning
Idan Attias, Steve Hanneke, Alkis Kalavasis, Amin Karbasi, Grigoris, Velegkas

TL;DR
This paper characterizes the statistical complexity of realizable regression in PAC and online learning, introducing new dimensions that determine learnability and designing optimal learners for both settings.
Contribution
It introduces a novel dimension for characterizing learnability, proposes a minimax instance optimal learner, and resolves an open question in online realizable regression.
Findings
A new dimension characterizes PAC learnability of real-valued predictor classes.
A combinatorial dimension related to the Graph dimension characterizes ERM learnability.
An optimal online learner for realizable regression is constructed, resolving prior open questions.
Abstract
In this work, we aim to characterize the statistical complexity of realizable regression both in the PAC learning setting and the online learning setting. Previous work had established the sufficiency of finiteness of the fat shattering dimension for PAC learnability and the necessity of finiteness of the scaled Natarajan dimension, but little progress had been made towards a more complete characterization since the work of Simon (SICOMP '97). To this end, we first introduce a minimax instance optimal learner for realizable regression and propose a novel dimension that both qualitatively and quantitatively characterizes which classes of real-valued predictors are learnable. We then identify a combinatorial dimension related to the Graph dimension that characterizes ERM learnability in the realizable setting. Finally, we establish a necessary condition for learnability based on a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMachine Learning and Algorithms · Imbalanced Data Classification Techniques · Machine Learning and Data Classification
