Beyond Perfect Scores: Proof-by-Contradiction for Trustworthy Machine Learning
Dushan N. Wadduwage, Dineth Jayakody, Leonidas Zimianitis

TL;DR
This paper proposes a stochastic proof-by-contradiction test to evaluate the trustworthiness of machine learning models in biomedical applications, distinguishing genuine causal learning from spurious correlations.
Contribution
It introduces a novel, interpretable trustworthiness testing method based on label permutation and p-values, applicable across diverse biomedical ML models.
Findings
Effective in identifying models relying on true clinical cues
Distinguishes genuine causal models from overfitting or shortcut learning
Provides interpretable statistical measures for trust assessment
Abstract
Machine learning (ML) models show strong promise for new biomedical prediction tasks, but concerns about trustworthiness have hindered their clinical adoption. In particular, it is often unclear whether a model relies on true clinical cues or on spurious hierarchical correlations in the data. This paper introduces a simple yet broadly applicable trustworthiness test grounded in stochastic proof-by-contradiction. Instead of just showing high test performance, our approach trains and tests on spurious labels carefully permuted based on a potential outcomes framework. A truly trustworthy model should fail under such label permutation; comparable accuracy across real and permuted labels indicates overfitting, shortcut learning, or data leakage. Our approach quantifies this behavior through interpretable Fisher-style p-values, which are well understood by domain experts across medical and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Artificial Intelligence in Healthcare and Education · Machine Learning in Materials Science
