From Sycophancy to Sensemaking: Premise Governance for Human-AI Decision Making
Raunak Jain

TL;DR
This paper proposes a premise governance framework for human-AI decision making that emphasizes collaborative negotiation over knowledge, aiming to improve reliability and trustworthiness in complex, uncertain scenarios.
Contribution
It introduces a discrepancy-driven control loop and commitment gating mechanisms to shift from answer generation to collaborative premise management in human-AI partnerships.
Findings
Framework enhances trust through auditable premises.
Discrepancy detection improves alignment in decision-critical contexts.
Proposed evaluation criteria enable falsifiable assessment.
Abstract
As LLMs expand from assistance to decision support, a dangerous pattern emerges: fluent agreement without calibrated judgment. Low-friction assistants can become sycophantic, baking in implicit assumptions and pushing verification costs onto experts, while outcomes arrive too late to serve as reward signals. In deep-uncertainty decisions (where objectives are contested and reversals are costly), scaling fluent agreement amplifies poor commitments faster than it builds expertise. We argue reliable human-AI partnership requires a shift from answer generation to collaborative premise governance over a knowledge substrate, negotiating only what is decision-critical. A discrepancy-driven control loop operates over this substrate: detecting conflicts, localizing misalignment via typed discrepancies (teleological, epistemic, procedural), and triggering bounded negotiation through decision…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Explainable Artificial Intelligence (XAI) · Human-Automation Interaction and Safety
