From Black-Box Confidence to Measurable Trust in Clinical AI: A Framework for Evidence, Supervision, and Staged Autonomy

Serhii Zabolotnii; Viktoriia Holinko; Olha Antonenko

arXiv:2604.26671·cs.CL·April 30, 2026

From Black-Box Confidence to Measurable Trust in Clinical AI: A Framework for Evidence, Supervision, and Staged Autonomy

Serhii Zabolotnii, Viktoriia Holinko, Olha Antonenko

PDF

TL;DR

This paper presents a practical framework for trustworthy clinical AI that emphasizes evidence, supervision, and staged autonomy, integrating deterministic logic with AI assistants and human oversight for reliable medical decision-making.

Contribution

It introduces a system architecture combining deterministic core, AI assistant, multi-tier escalation, and supervision, along with trust metrics based on measurement principles for operational trust assessment.

Findings

01

Trust depends on selective verification of critical findings.

02

Modular prompting enables scaling clinical depth without performance loss.

03

Trustworthiness is achieved through system architecture, not individual models.

Abstract

Trust in clinical artificial intelligence (AI) cannot be reduced to model accuracy, fluency of generation, or overall positive user impression. In medicine, trust must be engineered as a measurable system property grounded in evidence, supervision, and operational boundaries of AI autonomy. This article proposes a practical framework for trustworthy clinical AI built around three principles: evidence, supervision, and staged autonomy. Rather than replacing deterministic clinical logic wholesale with end-to-end black-box models, the proposed approach combines a deterministic core, a patient-specific AI assistant for contextual validation, a multi-tier model escalation mechanism, and a human supervision layer for verification, escalation, and risk control. We demonstrate that trust also depends on selective verification of clinically critical findings, bounded clinical context,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.