The Productivity-Reliability Paradox: Specification-Driven Governance for AI-Augmented Software Development

Sabry E. Farrag

arXiv:2605.01160·cs.SE·May 5, 2026

The Productivity-Reliability Paradox: Specification-Driven Governance for AI-Augmented Software Development

Sabry E. Farrag

PDF

TL;DR

This paper investigates the Productivity-Reliability Paradox in AI-assisted software development, emphasizing the importance of specification discipline over model capability for dependable outcomes.

Contribution

It formally defines the PRP, proposes a taxonomy of AI methodologies, introduces a governance model, and evaluates practical instantiations through a pilot study.

Findings

01

Specification discipline is crucial for dependable AI-assisted development.

02

The PRP is influenced by task abstraction, codebase maturity, and developer experience.

03

Governance models can mitigate the PRP's effects.

Abstract

Since 2022, AI-powered coding assistants have produced contradictory evidence: controlled studies report 20-56% productivity gains on well-scoped tasks, while the most rigorous RCT documents a 19% slowdown for experienced developers, and telemetry across 10,000+ developers shows 98% more pull requests but 91% longer review times with flat delivery metrics. This paper argues these findings constitute the Productivity-Reliability Paradox (PRP): a systematic phenomenon emerging from non-deterministic code generators and insufficient specification discipline. Through a multivocal literature review of 67 sources (2022-2026), this paper: (1) formally defines the PRP with three moderating variables (task abstraction, codebase maturity, developer experience) and two amplifying mechanisms (code review bottleneck, context window constraint); (2) proposes the AI-Augmented Methodology Taxonomy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.