Incentive-compatible Bandits: Importance Weighting No More

Julian Zimmert; Teodor V. Marinov

arXiv:2405.06480·cs.LG·May 13, 2024

Incentive-compatible Bandits: Importance Weighting No More

Julian Zimmert, Teodor V. Marinov

PDF

Open Access

TL;DR

This paper introduces the first incentive-compatible bandit algorithms with near-optimal regret bounds, improving upon previous work by removing importance weighting and achieving strong guarantees in stochastic and adversarial settings.

Contribution

It presents novel incentive-compatible algorithms with $O( oot{K}{T})$ regret, simplifies existing algorithms via loss-biasing, and achieves best-of-both-worlds guarantees.

Findings

01

Achieved $O( oot{K}{T})$ regret bounds for incentive-compatible bandit algorithms.

02

Demonstrated that simple loss-biasing improves regret bounds of existing algorithms.

03

Developed a bandit algorithm with nearly optimal regret that operates without importance-weighted estimators.

Abstract

We study the problem of incentive-compatible online learning with bandit feedback. In this class of problems, the experts are self-interested agents who might misrepresent their preferences with the goal of being selected most often. The goal is to devise algorithms which are simultaneously incentive-compatible, that is the experts are incentivised to report their true preferences, and have no regret with respect to the preferences of the best fixed expert in hindsight. \citet{freeman2020no} propose an algorithm in the full information setting with optimal $O (T lo g (K))$ regret and $O (T^{2/3} (K lo g (K))^{1/3})$ regret in the bandit setting. In this work we propose the first incentive-compatible algorithms that enjoy $O (K T)$ regret bounds. We further demonstrate how simple loss-biasing allows the algorithm proposed in Freeman et al. 2020 to enjoy $\tilde{O} (K T)$ …

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFinancial Markets and Investment Strategies · Advanced Bandit Algorithms Research · Financial Literacy, Pension, Retirement Analysis