Online Realizable Regression and Applications for ReLU Networks
Ilan Doron-Arad, Idan Mehalel, Elchanan Mossel

TL;DR
This paper introduces a new potential method to analyze realizable online regression with ReLU networks, providing bounds on cumulative loss based on entropy integrals and covering numbers, with applications to neural network complexity.
Contribution
It develops a generic potential approach to bound online regression loss using entropy integrals, and applies this to ReLU networks to distinguish between learnable and unlearnable cases.
Findings
Bounded cumulative loss for polynomial metric entropy classes.
Sharp dichotomy for Lipschitz regression based on q and d.
Finite loss for bounded-norm ReLU networks, infinite for certain classification cases.
Abstract
Realizable online regression can behave very differently from online classification. Even without any margin or stochastic assumptions, realizability may enforce horizon-free (finite) cumulative loss under metric-like losses, even when the analogous classification problem has an infinite mistake bound. We study realizable online regression in the adversarial model under losses that satisfy an approximate triangle inequality (approximate pseudo-metrics). Recent work of Attias et al. shows that the minimax realizable cumulative loss is characterized by the scaled Littlestone/online dimension , but this quantity can be difficult to analyze. Our main contribution is a generic potential method that upper bounds by a concrete Dudley-type entropy integral that depends only on covering numbers of the hypothesis class under the induced sup…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Advanced Bandit Algorithms Research · Adversarial Robustness in Machine Learning
