Active Exploration via Autoregressive Generation of Missing Data

Tiffany Tianhui Cai; Hongseok Namkoong; Daniel Russo; Kelly W Zhang

arXiv:2405.19466·cs.LG·February 6, 2025

Active Exploration via Autoregressive Generation of Missing Data

Tiffany Tianhui Cai, Hongseok Namkoong, Daniel Russo, Kelly W Zhang

PDF

Open Access

TL;DR

This paper introduces a novel approach to uncertainty quantification and exploration in online decision-making by using autoregressive generative models to predict missing outcomes, enabling more effective exploration and decision strategies.

Contribution

It proposes viewing uncertainty as missing future outcomes, leveraging autoregressive models for prediction and exploration, and demonstrates theoretical and empirical benefits in meta-bandit problems.

Findings

01

Successful reduction from offline prediction to online decision-making

02

Effective exploration in a news recommendation task using text features

03

Theoretical guarantees for uncertainty quantification

Abstract

We pose uncertainty quantification and exploration in online decision-making as a problem of training and generation from an autoregressive sequence model, an area experiencing rapid innovation. Our approach rests on viewing uncertainty as arising from missing future outcomes that would be revealed through appropriate action choices, rather than from unobservable latent parameters of the environment. This reformulation aligns naturally with modern machine learning capabilities: we can i) train generative models through next-outcome prediction rather than fit explicit priors, ii) assess uncertainty through autoregressive generation rather than parameter sampling, and iii) adapt to new information through in-context learning rather than explicit posterior updating. To showcase these ideas, we formulate a challenging meta-bandit problem where effective performance requires leveraging…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing