Optimistic Feasible Search for Closed-Loop Fair Threshold Decision-Making

Wenzhang Du

arXiv:2512.22313·cs.LG·December 30, 2025

Optimistic Feasible Search for Closed-Loop Fair Threshold Decision-Making

Wenzhang Du

PDF

Open Access

TL;DR

This paper introduces OFS, a simple grid-based online learning algorithm for fair threshold decision-making in closed-loop systems, effectively balancing reward maximization and fairness constraints.

Contribution

The paper proposes OFS, a novel optimistic feasible search algorithm that efficiently learns fair thresholds in closed-loop settings with feedback constraints.

Findings

01

OFS outperforms unconstrained and primal-dual baselines in reward and fairness.

02

OFS is near-oracle optimal among feasible fixed thresholds.

03

Experiments demonstrate OFS's effectiveness in synthetic and semi-synthetic benchmarks.

Abstract

Closed-loop decision-making systems (e.g., lending, screening, or recidivism risk assessment) often operate under fairness and service constraints while inducing feedback effects: decisions change who appears in the future, yielding non-stationary data and potentially amplifying disparities. We study online learning of a one-dimensional threshold policy from bandit feedback under demographic parity (DP) and, optionally, service-rate constraints. The learner observes only a scalar score each round and selects a threshold; reward and constraint residuals are revealed only for the chosen threshold. We propose Optimistic Feasible Search (OFS), a simple grid-based method that maintains confidence bounds for reward and constraint residuals for each candidate threshold. At each round, OFS selects a threshold that appears feasible under confidence bounds and, among those, maximizes optimistic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Reinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI)