A hierarchical Vovk-Azoury-Warmuth forecaster with discounting for online regression in RKHS

Dmitry B. Rokhlin

arXiv:2506.22631·cs.LG·July 1, 2025

A hierarchical Vovk-Azoury-Warmuth forecaster with discounting for online regression in RKHS

Dmitry B. Rokhlin

PDF

Open Access

TL;DR

This paper introduces a hierarchical, fully adaptive online regression algorithm in RKHS that learns the discount factor and number of features, achieving near-optimal dynamic regret with manageable computational complexity.

Contribution

It extends the discounted Vovk-Azoury-Warmuth framework to non-parametric RKHS using random features, with an adaptive algorithm that learns key parameters.

Findings

01

Achieves expected dynamic regret of O(T^{2/3} P_T^{1/3} + √T ln T)

02

Per-iteration complexity is O(T ln T)

03

Learns both discount factor and number of random features adaptively

Abstract

We study the problem of online regression with the unconstrained quadratic loss against a time-varying sequence of functions from a Reproducing Kernel Hilbert Space (RKHS). Recently, Jacobsen and Cutkosky (2024) introduced a discounted Vovk-Azoury-Warmuth (DVAW) forecaster that achieves optimal dynamic regret in the finite-dimensional case. In this work, we lift their approach to the non-parametric domain by synthesizing the DVAW framework with a random feature approximation. We propose a fully adaptive, hierarchical algorithm, which we call H-VAW-D (Hierarchical Vovk-Azoury-Warmuth with Discounting), that learns both the discount factor and the number of random features. We prove that this algorithm, which has a per-iteration computational complexity of $O (T ln T)$ , achieves an expected dynamic regret of $O (T^{2/3} P_{T}^{1/3} + T ln T)$ , where $P_{T}$ is the functional path length…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques