AI Integrity: A New Paradigm for Verifiable AI Governance

Seulki Lee

arXiv:2604.11065·cs.AI·April 14, 2026

AI Integrity: A New Paradigm for Verifiable AI Governance

Seulki Lee

PDF

TL;DR

This paper proposes AI Integrity as a verifiable governance paradigm focusing on protecting and auditing the layered authority stack of AI systems to ensure transparency and prevent manipulation.

Contribution

It introduces the AI Integrity concept, defines the Authority Stack model, and develops the PRISM framework for measuring and maintaining AI system integrity.

Findings

01

Defines the four-layer Authority Stack model for AI governance.

02

Introduces the PRISM framework with six core metrics for verification.

03

Distinguishes AI Integrity from existing paradigms by emphasizing procedural transparency.

Abstract

AI systems increasingly shape high-stakes decisions in healthcare, law, defense, and education, yet existing governance paradigms -- AI Ethics, AI Safety, and AI Alignment -- share a common limitation: they evaluate outcomes rather than verifying the reasoning process itself. This paper introduces AI Integrity, a concept defined as a state in which the Authority Stack of an AI system -- its layered hierarchy of values, epistemological standards, source preferences, and data selection criteria -- is protected from corruption, contamination, manipulation, and bias, and maintained in a verifiable manner. We distinguish AI Integrity from the three existing paradigms, define the Authority Stack as a 4-layer cascade model (Normative, Epistemic, Source, and Data Authority) grounded in established academic frameworks -- Schwartz Basic Human Values for normative authority, Walton argumentation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.