Adaptive Matrix Online Learning through Smoothing with Guarantees for Nonsmooth Nonconvex Optimization

Ruichen Jiang; Zakaria Mhammedi; Mehryar Mohri; Aryan Mokhtari

arXiv:2602.08232·math.OC·February 10, 2026

Adaptive Matrix Online Learning through Smoothing with Guarantees for Nonsmooth Nonconvex Optimization

Ruichen Jiang, Zakaria Mhammedi, Mehryar Mohri, Aryan Mokhtari

PDF

Open Access

TL;DR

This paper introduces adaptive matrix online learning algorithms that use smoothing techniques to achieve regret bounds comparable to existing methods but with reduced computational complexity, and extends guarantees to nonsmooth nonconvex optimization.

Contribution

It develops efficient adaptive algorithms for matrix online learning using smoothing, providing regret guarantees and convergence analysis in nonsmooth nonconvex settings.

Findings

01

Achieves regret bounds matching one-sided Shampoo with less computation.

02

Introduces two smoothing-based methods: FTPL with Gaussian smoothing and FAML with hyperbolic smoothing.

03

Provides convergence guarantees for the derived matrix optimizers in nonsmooth nonconvex optimization.

Abstract

We study online linear optimization with matrix variables constrained by the operator norm, a setting where the geometry renders designing data-dependent and efficient adaptive algorithms challenging. The best-known adaptive regret bounds are achieved by Shampoo-like methods, but they require solving a costly quadratic projection subproblem. To address this, we extend the gradient-based prediction scheme to adaptive matrix online learning and cast algorithm design as constructing a family of smoothed potentials for the nuclear norm. We define a notion of admissibility for such smoothings and prove any admissible smoothing yields a regret bound matching the best-known guarantees of one-sided Shampoo. We instantiate this framework with two efficient methods that avoid quadratic projections. The first is an adaptive Follow-the-Perturbed-Leader (FTPL) method using Gaussian stochastic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques