Attack-Resistant Uniform Fairness for Linear and Smooth Contextual Bandits

Qingwen Zhang; Wenjia Wang

arXiv:2602.04125·stat.ML·February 5, 2026

Attack-Resistant Uniform Fairness for Linear and Smooth Contextual Bandits

Qingwen Zhang, Wenjia Wang

PDF

Open Access

TL;DR

This paper introduces attack-resistant algorithms for fair contextual bandits, balancing near-optimal regret with fairness guarantees, and addressing vulnerabilities to strategic manipulation.

Contribution

It proposes novel algorithms achieving minimax-optimal regret under fairness constraints and develops robustness against strategic attacks in contextual bandit settings.

Findings

01

Algorithms maintain fairness and efficiency under attack.

02

Robust variants achieve optimal regret with attack budgets.

03

Fairness can be exploited by adversaries with minimal resources.

Abstract

Modern systems, such as digital platforms and service systems, increasingly rely on contextual bandits for online decision-making; however, their deployment can inadvertently create unfair exposure among arms, undermining long-term platform sustainability and supplier trust. This paper studies the contextual bandit problem under a uniform $(1 - δ)$ -fairness constraint, and addresses its unique vulnerabilities to strategic manipulation. The fairness constraint ensures that preferential treatment is strictly justified by an arm's actual reward across all contexts and time horizons, using uniformity to prevent statistical loopholes. We develop novel algorithms that achieve (nearly) minimax-optimal regret for both linear and smooth reward functions, while maintaining strong $(1 - \tilde{O} (1/ T))$ -fairness guarantees, and further characterize the theoretically inherent yet asymptotically…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Age of Information Optimization · Ethics and Social Impacts of AI