A feature-stable and explainable machine learning framework for trustworthy decision-making under incomplete clinical data
Justyna Andrys-Olek, Paulina Tworek, Luca Gherardini, Mark W. Ruddock, Mary Jo Kurt, Peter Fitzgerald, Jose Sousa

TL;DR
This paper introduces CACTUS, an explainable machine learning framework that enhances feature stability and trustworthiness in clinical data analysis, especially under conditions of data incompleteness and heterogeneity.
Contribution
CACTUS uniquely combines feature abstraction, interpretability, and stability analysis to improve robustness and trustworthiness of models on small, incomplete biomedical datasets.
Findings
CACTUS achieves high predictive accuracy with stable feature importance under missing data.
Feature stability correlates with trustworthiness and complements traditional performance metrics.
Demonstrated effectiveness on a real-world bladder cancer dataset.
Abstract
Machine learning models are increasingly applied to biomedical data, yet their adoption in high stakes domains remains limited by poor robustness, limited interpretability, and instability of learned features under realistic data perturbations, such as missingness. In particular, models that achieve high predictive performance may still fail to inspire trust if their key features fluctuate when data completeness changes, undermining reproducibility and downstream decision-making. Here, we present CACTUS (Comprehensive Abstraction and Classification Tool for Uncovering Structures), an explainable machine learning framework explicitly designed to address these challenges in small, heterogeneous, and incomplete clinical datasets. CACTUS integrates feature abstraction, interpretable classification, and systematic feature stability analysis to quantify how consistently informative features…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Healthcare · Artificial Intelligence in Healthcare and Education
