TRACE: A Metrologically-Grounded Engineering Framework for Trustworthy Agentic AI Systems in Operationally Critical Domains

Serhii Zabolotnii

arXiv:2605.03838·cs.CL·May 6, 2026

TRACE: A Metrologically-Grounded Engineering Framework for Trustworthy Agentic AI Systems in Operationally Critical Domains

Serhii Zabolotnii

PDF

TL;DR

TRACE is a comprehensive engineering framework for trustworthy agentic AI in critical domains, integrating a layered architecture, trust metrics, and model parsimony principles across diverse applications.

Contribution

It introduces a novel, metrologically grounded framework with explicit architecture, trust metrics, and a focus on model parsimony, applicable across various operational domains.

Findings

01

Validated across clinical, industrial, and judicial AI applications.

02

Introduced the Computational Parsimony Ratio (CPR) as a measure of model efficiency.

03

Demonstrated the importance of explicit LLM validation separation in architecture.

Abstract

We introduce TRACE, a cross-domain engineering framework for trustworthy agentic AI in operationally critical domains. TRACE combines a four-layer reference architecture with an explicit classical-ML vs. LLM-validator split (L2a/L2b), a stateful orchestration-and-escalation policy (L3), and bounded human supervision (L4); a metrologically grounded trust-metric suite mapped to GUM/VIM/ISO 17025; and a Model-Parsimony principle quantified by the Computational Parsimony Ratio (CPR). Three instantiations--clinical decision support, industrial multi-domain operations, and a judicial AI assistant--transfer the samearchitecture and metrics across principally different governance contexts. The L2a/L2b separation makes the use of large language models a deliberate design decision rather than an architectural default, with parsimony quantified through CPR. TRACE introduces CPR as a first-class…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.