Optimistic Feasible Search for Closed-Loop Fair Threshold Decision-Making
Wenzhang Du

TL;DR
This paper introduces OFS, a simple grid-based online learning algorithm for fair threshold decision-making in closed-loop systems, effectively balancing reward maximization and fairness constraints.
Contribution
The paper proposes OFS, a novel optimistic feasible search algorithm that efficiently learns fair thresholds in closed-loop settings with feedback constraints.
Findings
OFS outperforms unconstrained and primal-dual baselines in reward and fairness.
OFS is near-oracle optimal among feasible fixed thresholds.
Experiments demonstrate OFS's effectiveness in synthetic and semi-synthetic benchmarks.
Abstract
Closed-loop decision-making systems (e.g., lending, screening, or recidivism risk assessment) often operate under fairness and service constraints while inducing feedback effects: decisions change who appears in the future, yielding non-stationary data and potentially amplifying disparities. We study online learning of a one-dimensional threshold policy from bandit feedback under demographic parity (DP) and, optionally, service-rate constraints. The learner observes only a scalar score each round and selects a threshold; reward and constraint residuals are revealed only for the chosen threshold. We propose Optimistic Feasible Search (OFS), a simple grid-based method that maintains confidence bounds for reward and constraint residuals for each candidate threshold. At each round, OFS selects a threshold that appears feasible under confidence bounds and, among those, maximizes optimistic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Reinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI)
