Contextual Online Decision Making with Infinite-Dimensional Functional Regression
Haichen Hu, Rui Ai, Stephen Bates, David Simchi-Levi

TL;DR
This paper introduces a universal framework for online decision-making that learns entire unknown distributions in infinite-dimensional spaces, addressing challenges in high-dimensional contextual problems with theoretical regret bounds.
Contribution
It proposes an efficient infinite-dimensional functional regression oracle for contextual CDFs, extending bandit algorithms to handle uncountably infinite regression dimensions.
Findings
Regret bounds depend on the eigenvalue decay rate of the design operator.
Polynomial decay of eigenvalues yields specific regret rates.
Numerical method provided for practical eigenvalue computation.
Abstract
Contextual sequential decision-making problems play a crucial role in machine learning, encompassing a wide range of downstream applications such as bandits, sequential hypothesis testing and online risk control. These applications often require different statistical measures, including expectation, variance and quantiles. In this paper, we provide a universal admissible algorithm framework for dealing with all kinds of contextual online decision-making problems that directly learns the whole underlying unknown distribution instead of focusing on individual statistics. This is much more difficult because the dimension of the regression is uncountably infinite, and any existing linear contextual bandits algorithm will result in infinite regret. To overcome this issue, we propose an efficient infinite-dimensional functional regression oracle for contextual cumulative distribution…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsData Stream Mining Techniques
MethodsExponential Decay
