Modeling Expert AI Diagnostic Alignment via Immutable Inference Snapshots

Dimitrios P. Panagoulias; Evangelia-Aikaterini Tsichrintzi; Georgios Savvidis; Evridiki Tsoureli-Nikita

arXiv:2602.22973·cs.AI·February 27, 2026

Modeling Expert AI Diagnostic Alignment via Immutable Inference Snapshots

Dimitrios P. Panagoulias, Evangelia-Aikaterini Tsichrintzi, Georgios Savvidis, Evridiki Tsoureli-Nikita

PDF

Open Access

TL;DR

This paper presents a structured framework for analyzing human-in-the-loop validation in clinical AI, using immutable inference snapshots and a multi-level concordance evaluation to better understand diagnostic alignment and correction dynamics.

Contribution

It introduces a diagnostic alignment framework with immutable inference states and a comprehensive concordance evaluation, advancing structured analysis of expert validation in clinical AI.

Findings

01

Exact agreement rate was 71.4% and stable under semantic similarity.

02

Structured cross-category analysis achieved 100% comprehensive concordance.

03

Binary lexical evaluation underestimates clinically meaningful alignment.

Abstract

Human-in-the-loop validation is essential in safety-critical clinical AI, yet the transition between initial model inference and expert correction is rarely analyzed as a structured signal. We introduce a diagnostic alignment framework in which the AI-generated image based report is preserved as an immutable inference state and systematically compared with the physician-validated outcome. The inference pipeline integrates a vision-enabled large language model, BERT- based medical entity extraction, and a Sequential Language Model Inference (SLMI) step to enforce domain-consistent refinement prior to expert review. Evaluation on 21 dermatological cases (21 complete AI physician pairs) em- ployed a four-level concordance framework comprising exact primary match rate (PMR), semantic similarity-adjusted rate (AMR), cross-category alignment, and Comprehensive Concordance Rate (CCR). Exact…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education · Machine Learning in Healthcare · Explainable Artificial Intelligence (XAI)