AI Integrity: A New Paradigm for Verifiable AI Governance
Seulki Lee

TL;DR
This paper proposes AI Integrity as a verifiable governance paradigm focusing on protecting and auditing the layered authority stack of AI systems to ensure transparency and prevent manipulation.
Contribution
It introduces the AI Integrity concept, defines the Authority Stack model, and develops the PRISM framework for measuring and maintaining AI system integrity.
Findings
Defines the four-layer Authority Stack model for AI governance.
Introduces the PRISM framework with six core metrics for verification.
Distinguishes AI Integrity from existing paradigms by emphasizing procedural transparency.
Abstract
AI systems increasingly shape high-stakes decisions in healthcare, law, defense, and education, yet existing governance paradigms -- AI Ethics, AI Safety, and AI Alignment -- share a common limitation: they evaluate outcomes rather than verifying the reasoning process itself. This paper introduces AI Integrity, a concept defined as a state in which the Authority Stack of an AI system -- its layered hierarchy of values, epistemological standards, source preferences, and data selection criteria -- is protected from corruption, contamination, manipulation, and bias, and maintained in a verifiable manner. We distinguish AI Integrity from the three existing paradigms, define the Authority Stack as a 4-layer cascade model (Normative, Epistemic, Source, and Data Authority) grounded in established academic frameworks -- Schwartz Basic Human Values for normative authority, Walton argumentation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
