The Ratio Index for Budgeted Learning, with Applications

Ashish Goel; Sanjeev Khanna; Brad Null

arXiv:0810.0558·cs.DS·April 12, 2016

The Ratio Index for Budgeted Learning, with Applications

Ashish Goel, Sanjeev Khanna, Brad Null

PDF

Open Access

TL;DR

This paper introduces the ratio index, a new index-based approach for budgeted learning and multi-armed bandits, providing constant factor approximations and efficient computation, improving upon existing algorithms and indices.

Contribution

The paper presents the ratio index, a novel index for budgeted learning that guarantees constant factor approximation and can be computed in strongly polynomial time.

Findings

01

The ratio index can be computed efficiently in strongly polynomial time.

02

It is a constant factor approximation to the Gittins index under certain conditions.

03

An index-based policy achieves an O(1)-approximation for finite horizon multi-armed bandits, independent of the horizon.

Abstract

In the budgeted learning problem, we are allowed to experiment on a set of alternatives (given a fixed experimentation budget) with the goal of picking a single alternative with the largest possible expected payoff. Approximation algorithms for this problem were developed by Guha and Munagala by rounding a linear program that couples the various alternatives together. In this paper we present an index for this problem, which we call the ratio index, which also guarantees a constant factor approximation. Index-based policies have the advantage that a single number (i.e. the index) can be computed for each alternative irrespective of all other alternatives, and the alternative with the highest index is experimented upon. This is analogous to the famous Gittins index for the discounted multi-armed bandit problem. The ratio index has several interesting structural properties. First, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOptimization and Search Problems · Machine Learning and Algorithms · Advanced Bandit Algorithms Research