Beyond Perfect Scores: Proof-by-Contradiction for Trustworthy Machine Learning

Dushan N. Wadduwage; Dineth Jayakody; Leonidas Zimianitis

arXiv:2601.06704·cs.LG·January 13, 2026

Beyond Perfect Scores: Proof-by-Contradiction for Trustworthy Machine Learning

Dushan N. Wadduwage, Dineth Jayakody, Leonidas Zimianitis

PDF

Open Access

TL;DR

This paper proposes a stochastic proof-by-contradiction test to evaluate the trustworthiness of machine learning models in biomedical applications, distinguishing genuine causal learning from spurious correlations.

Contribution

It introduces a novel, interpretable trustworthiness testing method based on label permutation and p-values, applicable across diverse biomedical ML models.

Findings

01

Effective in identifying models relying on true clinical cues

02

Distinguishes genuine causal models from overfitting or shortcut learning

03

Provides interpretable statistical measures for trust assessment

Abstract

Machine learning (ML) models show strong promise for new biomedical prediction tasks, but concerns about trustworthiness have hindered their clinical adoption. In particular, it is often unclear whether a model relies on true clinical cues or on spurious hierarchical correlations in the data. This paper introduces a simple yet broadly applicable trustworthiness test grounded in stochastic proof-by-contradiction. Instead of just showing high test performance, our approach trains and tests on spurious labels carefully permuted based on a potential outcomes framework. A truly trustworthy model should fail under such label permutation; comparable accuracy across real and permuted labels indicates overfitting, shortcut learning, or data leakage. Our approach quantifies this behavior through interpretable Fisher-style p-values, which are well understood by domain experts across medical and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Artificial Intelligence in Healthcare and Education · Machine Learning in Materials Science