Functional Bandits

Long Tran-Thanh; Jia Yuan Yu

arXiv:1405.2432·stat.ML·May 13, 2014

Functional Bandits

Long Tran-Thanh, Jia Yuan Yu

PDF

Open Access

TL;DR

This paper introduces the functional bandit problem, focusing on optimizing known functionals of unknown reward distributions, and proposes an efficient method combining estimation and elimination with theoretical guarantees.

Contribution

It presents a novel approach for functional bandits that effectively handles complex functionals, expanding the applicability of bandit algorithms to new domains.

Findings

01

Achieves provably efficient performance guarantees.

02

Effectively handles risk management and information theory functionals.

03

Demonstrates improved arm identification in complex settings.

Abstract

We introduce the functional bandit problem, where the objective is to find an arm that optimises a known functional of the unknown arm-reward distributions. These problems arise in many settings such as maximum entropy methods in natural language processing, and risk-averse decision-making, but current best-arm identification techniques fail in these domains. We propose a new approach, that combines functional estimation and arm elimination, to tackle this problem. This method achieves provably efficient performance guarantees. In addition, we illustrate this method on a number of important functionals in risk management and information theory, and refine our generic theoretical results in those cases.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Risk and Portfolio Optimization