Satisficing Regret Minimization in Bandits: Constant Rate and Light-Tailed Distribution

Qing Feng; Tianyi Ma; Ruihao Zhu

arXiv:2406.06802·stat.ML·July 1, 2025·1 cites

Satisficing Regret Minimization in Bandits: Constant Rate and Light-Tailed Distribution

Qing Feng, Tianyi Ma, Ruihao Zhu

PDF

Open Access

TL;DR

This paper introduces algorithms for satisficing regret minimization in bandit problems, achieving constant or light-tailed regret distributions, and demonstrates their effectiveness through theoretical guarantees and empirical validation.

Contribution

The paper proposes the SELECT and SELECT-LITE algorithms that attain constant and light-tailed satisficing regret, respectively, in bandit optimization, extending applicability to heavy-tailed oracle scenarios.

Findings

01

SELECT achieves constant expected satisficing regret in realizable cases.

02

SELECT-LITE attains light-tailed satisficing regret distribution and sub-linear standard regret.

03

Algorithms perform well on synthetic and real-world datasets, including dynamic pricing.

Abstract

Motivated by the concept of satisficing in decision-making, we consider the problem of satisficing regret minimization in bandit optimization. In this setting, the learner aims at selecting satisficing arms (arms with mean reward exceeding a certain threshold value) as frequently as possible. The performance is measured by satisficing regret, which is the cumulative deficit of the chosen arm's mean reward compared to the threshold. We propose SELECT, a general algorithmic template for Satisficing REgret Minimization via SampLing and LowEr Confidence bound Testing, that attains constant expected satisficing regret for a wide variety of bandit optimization problems in the realizable case (i.e., a satisficing arm exists). As a complement, SELECT also enjoys the same (standard) regret guarantee as the oracle in the non-realizable case. To further ensure stability of the algorithm, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI in Service Interactions