Online Decision Making with Fairness over Time
Rui Chen, Oktay Gunluk, Andrea Lodi, Guanyi Wang

TL;DR
This paper introduces an online decision-making framework that balances reward maximization with complex, long-term fairness constraints, using a primal-dual algorithm that adapts dynamically over time.
Contribution
It proposes a novel primal-dual algorithm for online decisions with general fairness constraints, achieving sublinear regret and constraint violation guarantees.
Findings
Achieves $ ilde{O}( oot{m}T)$ regret in expected reward.
Guarantees $O( oot{m}T)$ violation of fairness constraints.
Extends guarantees to demand patterns like periodicity and perturbation.
Abstract
Online platforms increasingly rely on sequential decision-making algorithms to allocate resources, match users, or control exposure, while facing growing pressure to ensure fairness over time. We study a general online decision-making framework in which a platform repeatedly makes decisions from possibly non-convex and discrete feasible sets, such as indivisible assignments or assortment choices, to maximize accumulated reward. Importantly, these decisions must jointly satisfy a set of general, -dimensional, potentially unbounded but convex global constraints, which model diverse long-term fairness goals beyond simple budget caps. We develop a primal-dual algorithm that interprets fairness constraints as dynamic prices and updates them online based on observed outcomes. The algorithm is simple to implement, requiring only the solution of perturbed local optimization problems at each…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimization and Search Problems · Advanced Bandit Algorithms Research · Auction Theory and Applications
