AIRA: AI-Induced Risk Audit: A Structured Inspection Framework for AI-Generated Code
William M. Parris

TL;DR
This paper introduces AIRA, a structured framework for auditing AI-generated code to detect failure patterns that may conceal true system failures, with empirical validation across multiple datasets.
Contribution
The paper proposes the AIRA inspection framework and demonstrates its effectiveness in identifying failure-untruthful patterns in AI-generated code.
Findings
AI-generated code shows more high-severity failures than human code.
AIRA detects failure patterns across multiple programming languages.
AI-assisted code exhibits a skew toward fail-soft behavior.
Abstract
Practitioners have reported a directional pattern in AI-assisted code generation: AI-generated code tends to fail quietly, preserving the appearance of functionality while degrading or concealing guarantees. This paper introduces the Reward-Shaped Failure Hypothesis - the proposal that this pattern may reflect an artifact of optimization through human feedback rather than a random distribution of bugs. We define failure truthfulness as the property that a system's observable outputs accurately represent its internal success or failure state. We then present AIRA (AI-Induced Risk Audit), a deterministic 15-check inspection framework designed to detect failure-untruthful patterns in code. We report results from three studies: (1) an anonymized enterprise environment audit, (2) a balanced 600-file public corpus pilot, and (3) a strict matched-control replication comparing 955 AI-attributed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
