Evaluating Machine Learning Models against Clinical Protocols for   Enhanced Interpretability and Continuity of Care

Christel Sirocchi; Muhammad Suffian; Federico Sabbatini; Alessandro; Bogliolo; Sara Montagna

arXiv:2411.03105·cs.AI·November 6, 2024

Evaluating Machine Learning Models against Clinical Protocols for Enhanced Interpretability and Continuity of Care

Christel Sirocchi, Muhammad Suffian, Federico Sabbatini, Alessandro, Bogliolo, Sara Montagna

PDF

Open Access 1 Repo

TL;DR

This paper introduces metrics and methods to compare ML models with clinical protocols, focusing on accuracy and interpretability, validated on diabetes data, showing integrated models can match data-driven performance and improve explanation alignment.

Contribution

It proposes new metrics for assessing ML model accuracy against clinical protocols and a method to compare explanation similarity, enhancing interpretability and trust in clinical ML applications.

Findings

01

Integrated ML models achieve performance comparable to data-driven models.

02

Integrated models exhibit higher accuracy than traditional clinical protocols.

03

Explanation similarity between ML models and clinical rules is improved with integration.

Abstract

In clinical practice, decision-making relies heavily on established protocols, often formalised as rules. Concurrently, Machine Learning (ML) models, trained on clinical data, aspire to integrate into medical decision-making processes. However, despite the growing number of ML applications, their adoption into clinical practice remains limited. Two critical concerns arise, relevant to the notions of consistency and continuity of care: (a) accuracy - the ML model, albeit more accurate, might introduce errors that would not have occurred by applying the protocol; (b) interpretability - ML models operating as black boxes might make predictions based on relationships that contradict established clinical knowledge. In this context, the literature suggests using ML models integrating domain knowledge for improved accuracy and interpretability. However, there is a lack of appropriate metrics…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ChristelSirocchi/XAI-similarity
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education · Health Systems, Economic Evaluations, Quality of Life · Clinical practice guidelines implementation

MethodsALIGN