Instance-Sensitive Algorithms for Pure Exploration in Multinomial Logit   Bandit

Nikolai Karpov; Qin Zhang

arXiv:2012.01499·cs.LG·August 17, 2021

Instance-Sensitive Algorithms for Pure Exploration in Multinomial Logit Bandit

Nikolai Karpov, Qin Zhang

PDF

Open Access 1 Video

TL;DR

This paper introduces efficient, instance-sensitive algorithms for pure exploration in the Multinomial Logit Bandit model, addressing a gap in bandit theory with theoretical bounds and practical implications.

Contribution

It provides the first efficient algorithms for pure exploration in MNL-bandits with instance-sensitive complexity guarantees and matching lower bounds.

Findings

01

Algorithms achieve instance-sensitive pull complexities.

02

Upper bounds are complemented by nearly matching lower bounds.

03

Advances understanding of pure exploration in MNL-bandit models.

Abstract

Motivated by real-world applications such as fast fashion retailing and online advertising, the Multinomial Logit Bandit (MNL-bandit) is a popular model in online learning and operations research, and has attracted much attention in the past decade. However, it is a bit surprising that pure exploration, a basic problem in bandit theory, has not been well studied in MNL-bandit so far. In this paper we give efficient algorithms for pure exploration in MNL-bandit. Our algorithms achieve instance-sensitive pull complexities. We also complement the upper bounds by an almost matching lower bound.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Instance-Sensitive Algorithms for Pure Exploration in Multinomial Logit Bandit· underline

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Optimization and Search Problems · Auction Theory and Applications