Active Exploration via Autoregressive Generation of Missing Data
Tiffany Tianhui Cai, Hongseok Namkoong, Daniel Russo, Kelly W Zhang

TL;DR
This paper introduces a novel approach to uncertainty quantification and exploration in online decision-making by using autoregressive generative models to predict missing outcomes, enabling more effective exploration and decision strategies.
Contribution
It proposes viewing uncertainty as missing future outcomes, leveraging autoregressive models for prediction and exploration, and demonstrates theoretical and empirical benefits in meta-bandit problems.
Findings
Successful reduction from offline prediction to online decision-making
Effective exploration in a news recommendation task using text features
Theoretical guarantees for uncertainty quantification
Abstract
We pose uncertainty quantification and exploration in online decision-making as a problem of training and generation from an autoregressive sequence model, an area experiencing rapid innovation. Our approach rests on viewing uncertainty as arising from missing future outcomes that would be revealed through appropriate action choices, rather than from unobservable latent parameters of the environment. This reformulation aligns naturally with modern machine learning capabilities: we can i) train generative models through next-outcome prediction rather than fit explicit priors, ii) assess uncertainty through autoregressive generation rather than parameter sampling, and iii) adapt to new information through in-context learning rather than explicit posterior updating. To showcase these ideas, we formulate a challenging meta-bandit problem where effective performance requires leveraging…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing
