Behavioural feasible set: Value alignment constraints on AI decision support
Taejin Park

TL;DR
This paper introduces the concept of a behavioural feasible set to understand how vendor-imposed value alignment constraints limit the decision-making flexibility of AI systems in organizations, revealing embedded stakeholder priorities.
Contribution
It formalizes the behavioural feasible set and demonstrates how alignment constraints reduce AI decision flexibility, impacting governance and stakeholder trade-offs.
Findings
Alignment compresses the recommendation set.
Alignment reduces system flexibility under contextual pressure.
Commercial models show similar or greater rigidity.
Abstract
When organisations adopt commercial AI systems for decision support, they inherit value judgements embedded by vendors that are neither transparent nor renegotiable. The governance puzzle is not whether AI can support decisions but which recommendations the system can actually produce given how its vendor has configured it. I formalise this as a behavioural feasible set, the range of recommendations reachable under vendor-imposed alignment constraints, and characterise diagnostic thresholds for when organisational requirements exceed the system's flexibility. In scenario-based experiments using binary decision scenarios and multi-stakeholder ranking tasks, I show that alignment materially compresses this set. Comparing pre- and post-alignment variants of an open-weight model isolates the mechanism: alignment makes the system substantially less able to shift its recommendation even under…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Digital Platforms and Economics · Management and Organizational Studies
