From Sycophancy to Sensemaking: Premise Governance for Human-AI Decision Making

Raunak Jain

arXiv:2602.02378·cs.CL·March 26, 2026

From Sycophancy to Sensemaking: Premise Governance for Human-AI Decision Making

Raunak Jain

PDF

Open Access

TL;DR

This paper proposes a premise governance framework for human-AI decision making that emphasizes collaborative negotiation over knowledge, aiming to improve reliability and trustworthiness in complex, uncertain scenarios.

Contribution

It introduces a discrepancy-driven control loop and commitment gating mechanisms to shift from answer generation to collaborative premise management in human-AI partnerships.

Findings

01

Framework enhances trust through auditable premises.

02

Discrepancy detection improves alignment in decision-critical contexts.

03

Proposed evaluation criteria enable falsifiable assessment.

Abstract

As LLMs expand from assistance to decision support, a dangerous pattern emerges: fluent agreement without calibrated judgment. Low-friction assistants can become sycophantic, baking in implicit assumptions and pushing verification costs onto experts, while outcomes arrive too late to serve as reward signals. In deep-uncertainty decisions (where objectives are contested and reversals are costly), scaling fluent agreement amplifies poor commitments faster than it builds expertise. We argue reliable human-AI partnership requires a shift from answer generation to collaborative premise governance over a knowledge substrate, negotiating only what is decision-critical. A discrepancy-driven control loop operates over this substrate: detecting conflicts, localizing misalignment via typed discrepancies (teleological, epistemic, procedural), and triggering bounded negotiation through decision…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI · Explainable Artificial Intelligence (XAI) · Human-Automation Interaction and Safety