A bandit-learning approach to multifidelity approximation
Yiming Xu, Vahid Keshavarzzadeh, Robert M. Kirby, Akil Narayan

TL;DR
This paper introduces a bandit-learning method for multifidelity approximation that adaptively leverages data of different fidelities to improve parameter estimates without prior statistical knowledge.
Contribution
It proposes the AETC algorithm, a novel adaptive approach that optimally balances exploration and exploitation in multifidelity settings under a linear model assumption.
Findings
The AETC algorithm is proven to be trajectory-wise optimal.
The method extends efficiently to vector-valued responses.
Numerical experiments confirm the theoretical advantages.
Abstract
Multifidelity approximation is an important technique in scientific computation and simulation. In this paper, we introduce a bandit-learning approach for leveraging data of varying fidelities to achieve precise estimates of the parameters of interest. Under a linear model assumption, we formulate a multifidelity approximation as a modified stochastic bandit, and analyze the loss for a class of policies that uniformly explore each model before exploiting. Utilizing the estimated conditional mean-squared error, we propose a consistent algorithm, adaptive Explore-Then-Commit (AETC), and establish a corresponding trajectory-wise optimality result. These results are then extended to the case of vector-valued responses, where we demonstrate that the algorithm is efficient without the need to worry about estimating high-dimensional parameters. The main advantage of our approach is that we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Data Classification · Data Stream Mining Techniques
