A feature-stable and explainable machine learning framework for trustworthy decision-making under incomplete clinical data

Justyna Andrys-Olek; Paulina Tworek; Luca Gherardini; Mark W. Ruddock; Mary Jo Kurt; Peter Fitzgerald; Jose Sousa

arXiv:2602.17364·cs.LG·February 20, 2026

A feature-stable and explainable machine learning framework for trustworthy decision-making under incomplete clinical data

Justyna Andrys-Olek, Paulina Tworek, Luca Gherardini, Mark W. Ruddock, Mary Jo Kurt, Peter Fitzgerald, Jose Sousa

PDF

Open Access

TL;DR

This paper introduces CACTUS, an explainable machine learning framework that enhances feature stability and trustworthiness in clinical data analysis, especially under conditions of data incompleteness and heterogeneity.

Contribution

CACTUS uniquely combines feature abstraction, interpretability, and stability analysis to improve robustness and trustworthiness of models on small, incomplete biomedical datasets.

Findings

01

CACTUS achieves high predictive accuracy with stable feature importance under missing data.

02

Feature stability correlates with trustworthiness and complements traditional performance metrics.

03

Demonstrated effectiveness on a real-world bladder cancer dataset.

Abstract

Machine learning models are increasingly applied to biomedical data, yet their adoption in high stakes domains remains limited by poor robustness, limited interpretability, and instability of learned features under realistic data perturbations, such as missingness. In particular, models that achieve high predictive performance may still fail to inspire trust if their key features fluctuate when data completeness changes, undermining reproducibility and downstream decision-making. Here, we present CACTUS (Comprehensive Abstraction and Classification Tool for Uncovering Structures), an explainable machine learning framework explicitly designed to address these challenges in small, heterogeneous, and incomplete clinical datasets. CACTUS integrates feature abstraction, interpretable classification, and systematic feature stability analysis to quantify how consistently informative features…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Healthcare · Artificial Intelligence in Healthcare and Education