Online Realizable Regression and Applications for ReLU Networks

Ilan Doron-Arad; Idan Mehalel; Elchanan Mossel

arXiv:2602.19172·cs.LG·February 24, 2026

Online Realizable Regression and Applications for ReLU Networks

Ilan Doron-Arad, Idan Mehalel, Elchanan Mossel

PDF

Open Access

TL;DR

This paper introduces a new potential method to analyze realizable online regression with ReLU networks, providing bounds on cumulative loss based on entropy integrals and covering numbers, with applications to neural network complexity.

Contribution

It develops a generic potential approach to bound online regression loss using entropy integrals, and applies this to ReLU networks to distinguish between learnable and unlearnable cases.

Findings

01

Bounded cumulative loss for polynomial metric entropy classes.

02

Sharp dichotomy for Lipschitz regression based on q and d.

03

Finite loss for bounded-norm ReLU networks, infinite for certain classification cases.

Abstract

Realizable online regression can behave very differently from online classification. Even without any margin or stochastic assumptions, realizability may enforce horizon-free (finite) cumulative loss under metric-like losses, even when the analogous classification problem has an infinite mistake bound. We study realizable online regression in the adversarial model under losses that satisfy an approximate triangle inequality (approximate pseudo-metrics). Recent work of Attias et al. shows that the minimax realizable cumulative loss is characterized by the scaled Littlestone/online dimension $D_{onl}$ , but this quantity can be difficult to analyze. Our main contribution is a generic potential method that upper bounds $D_{onl}$ by a concrete Dudley-type entropy integral that depends only on covering numbers of the hypothesis class under the induced sup…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Advanced Bandit Algorithms Research · Adversarial Robustness in Machine Learning