Scale-free Unconstrained Online Learning for Curved Losses
Jack J. Mayo, H\'edi Hadiji, Tim van Erven

TL;DR
This paper demonstrates that in unconstrained online learning, adaptivity to comparator norm and gradient bounds can be achieved without additional costs for common losses like logistic and square loss, providing scale-free algorithms with matching lower bounds.
Contribution
It shows that no extra penalty for adaptivity exists for several supervised online learning losses, extending previous results and filling gaps with matching lower bounds.
Findings
No adaptivity cost for log loss, logistic regression, and square loss.
Provided scale-free algorithms invariant under data rescaling.
Achieved adaptive linear logistic regression with efficient algorithms.
Abstract
A sequence of works in unconstrained online convex optimisation have investigated the possibility of adapting simultaneously to the norm of the comparator and the maximum norm of the gradients. In full generality, matching upper and lower bounds are known which show that this comes at the unavoidable cost of an additive , which is not needed when either or is known in advance. Surprisingly, recent results by Kempka et al. (2019) show that no such price for adaptivity is needed in the specific case of -Lipschitz losses like the hinge loss. We follow up on this observation by showing that there is in fact never a price to pay for adaptivity if we specialise to any of the other common supervised online learning losses: our results cover log loss, (linear and non-parametric) logistic regression, square loss prediction, and (linear and non-parametric) least-squares…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Machine Learning and Algorithms
MethodsLogistic Regression
