Incentive-compatible Bandits: Importance Weighting No More
Julian Zimmert, Teodor V. Marinov

TL;DR
This paper introduces the first incentive-compatible bandit algorithms with near-optimal regret bounds, improving upon previous work by removing importance weighting and achieving strong guarantees in stochastic and adversarial settings.
Contribution
It presents novel incentive-compatible algorithms with $O( oot{K}{T})$ regret, simplifies existing algorithms via loss-biasing, and achieves best-of-both-worlds guarantees.
Findings
Achieved $O( oot{K}{T})$ regret bounds for incentive-compatible bandit algorithms.
Demonstrated that simple loss-biasing improves regret bounds of existing algorithms.
Developed a bandit algorithm with nearly optimal regret that operates without importance-weighted estimators.
Abstract
We study the problem of incentive-compatible online learning with bandit feedback. In this class of problems, the experts are self-interested agents who might misrepresent their preferences with the goal of being selected most often. The goal is to devise algorithms which are simultaneously incentive-compatible, that is the experts are incentivised to report their true preferences, and have no regret with respect to the preferences of the best fixed expert in hindsight. \citet{freeman2020no} propose an algorithm in the full information setting with optimal regret and regret in the bandit setting. In this work we propose the first incentive-compatible algorithms that enjoy regret bounds. We further demonstrate how simple loss-biasing allows the algorithm proposed in Freeman et al. 2020 to enjoy …
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFinancial Markets and Investment Strategies · Advanced Bandit Algorithms Research · Financial Literacy, Pension, Retirement Analysis
