Sequential Decision Making on Unmatched Data using Bayesian Kernel Embeddings
Diego Martinez-Taboada, Dino Sejdinovic

TL;DR
This paper introduces a novel Bayesian kernel embedding-based algorithm for sequential decision making with unmatched data, accounting for uncertainty in feature distribution and unknown functions, outperforming existing methods.
Contribution
The paper presents a new algorithm that models feature distributions as Bayesian embeddings and unknown functions as Gaussian processes for improved decision making.
Findings
Empirically outperforms current state-of-the-art algorithms.
Effectively models uncertainty in feature distribution and function estimation.
Demonstrates robustness on experimental datasets.
Abstract
The problem of sequentially maximizing the expectation of a function seeks to maximize the expected value of a function of interest without having direct control on its features. Instead, the distribution of such features depends on a given context and an action taken by an agent. In contrast to Bayesian optimization, the arguments of the function are not under agent's control, but are indirectly determined by the agent's action based on a given context. If the information of the features is to be included in the maximization problem, the full conditional distribution of such features, rather than its expectation only, needs to be accounted for. Furthermore, the function is itself unknown, only counting with noisy observations of such function, and potentially requiring the use of unmatched data sets. We propose a novel algorithm for the aforementioned problem which takes into…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Target Tracking and Data Fusion in Sensor Networks · Advanced Bandit Algorithms Research
