Evaluating Machine Learning Models against Clinical Protocols for Enhanced Interpretability and Continuity of Care
Christel Sirocchi, Muhammad Suffian, Federico Sabbatini, Alessandro, Bogliolo, Sara Montagna

TL;DR
This paper introduces metrics and methods to compare ML models with clinical protocols, focusing on accuracy and interpretability, validated on diabetes data, showing integrated models can match data-driven performance and improve explanation alignment.
Contribution
It proposes new metrics for assessing ML model accuracy against clinical protocols and a method to compare explanation similarity, enhancing interpretability and trust in clinical ML applications.
Findings
Integrated ML models achieve performance comparable to data-driven models.
Integrated models exhibit higher accuracy than traditional clinical protocols.
Explanation similarity between ML models and clinical rules is improved with integration.
Abstract
In clinical practice, decision-making relies heavily on established protocols, often formalised as rules. Concurrently, Machine Learning (ML) models, trained on clinical data, aspire to integrate into medical decision-making processes. However, despite the growing number of ML applications, their adoption into clinical practice remains limited. Two critical concerns arise, relevant to the notions of consistency and continuity of care: (a) accuracy - the ML model, albeit more accurate, might introduce errors that would not have occurred by applying the protocol; (b) interpretability - ML models operating as black boxes might make predictions based on relationships that contradict established clinical knowledge. In this context, the literature suggests using ML models integrating domain knowledge for improved accuracy and interpretability. However, there is a lack of appropriate metrics…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Health Systems, Economic Evaluations, Quality of Life · Clinical practice guidelines implementation
MethodsALIGN
