Attack-Resistant Uniform Fairness for Linear and Smooth Contextual Bandits
Qingwen Zhang, Wenjia Wang

TL;DR
This paper introduces attack-resistant algorithms for fair contextual bandits, balancing near-optimal regret with fairness guarantees, and addressing vulnerabilities to strategic manipulation.
Contribution
It proposes novel algorithms achieving minimax-optimal regret under fairness constraints and develops robustness against strategic attacks in contextual bandit settings.
Findings
Algorithms maintain fairness and efficiency under attack.
Robust variants achieve optimal regret with attack budgets.
Fairness can be exploited by adversaries with minimal resources.
Abstract
Modern systems, such as digital platforms and service systems, increasingly rely on contextual bandits for online decision-making; however, their deployment can inadvertently create unfair exposure among arms, undermining long-term platform sustainability and supplier trust. This paper studies the contextual bandit problem under a uniform -fairness constraint, and addresses its unique vulnerabilities to strategic manipulation. The fairness constraint ensures that preferential treatment is strictly justified by an arm's actual reward across all contexts and time horizons, using uniformity to prevent statistical loopholes. We develop novel algorithms that achieve (nearly) minimax-optimal regret for both linear and smooth reward functions, while maintaining strong -fairness guarantees, and further characterize the theoretically inherent yet asymptotically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Age of Information Optimization · Ethics and Social Impacts of AI
