Blind Exploration and Exploitation of Stochastic Experts
Noyan C. Sevuktekin, Andrew C. Singer

TL;DR
This paper introduces blind exploration and exploitation algorithms for identifying reliable stochastic experts in multi-armed bandit problems, using unsupervised measures of competence to enable joint expert consultation without true state feedback.
Contribution
It proposes an empirically realizable competence measure that preserves true competence orderings and facilitates unsupervised expert aggregation and decision-making.
Findings
BEE algorithms perform comparably to supervised methods in experiments.
The proposed measure effectively ranks expert reliability without supervision.
Blind algorithms exhibit manageable regret growth similar to supervised counterparts.
Abstract
We present blind exploration and exploitation (BEE) algorithms for identifying the most reliable stochastic expert based on formulations that employ posterior sampling, upper-confidence bounds, empirical Kullback-Leibler divergence, and minmax methods for the stochastic multi-armed bandit problem. Joint sampling and consultation of experts whose opinions depend on the hidden and random state of the world becomes challenging in the unsupervised, or blind, framework as feedback from the true state is not available. We propose an empirically realizable measure of expert competence that can be inferred instantaneously using only the opinions of other experts. This measure preserves the ordering of true competences and thus enables joint sampling and consultation of stochastic experts based on their opinions on dynamically changing tasks. Statistics derived from the proposed measure is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Mobile Crowdsensing and Crowdsourcing · Data Stream Mining Techniques
