Unmasking Biases and Reliability Concerns in Convolutional Neural Networks Analysis of Cancer Pathology Images

Michael Okonoda; Eder Martinez; Abhilekha Dalal; Lior Shamir

arXiv:2603.12445·eess.IV·March 16, 2026

Unmasking Biases and Reliability Concerns in Convolutional Neural Networks Analysis of Cancer Pathology Images

Michael Okonoda, Eder Martinez, Abhilekha Dalal, Lior Shamir

PDF

Open Access

TL;DR

This paper reveals that CNNs used in cancer pathology can achieve high accuracy on biased datasets lacking clinical information, raising concerns about the reliability of current evaluation practices.

Contribution

It critically examines the evaluation methods of CNNs in cancer pathology, highlighting biases and potential overestimation of model performance.

Findings

01

CNNs achieved up to 93% accuracy on non-clinical background segments.

02

Some architectures are more sensitive to dataset biases than others.

03

Common evaluation practices may lead to unreliable conclusions in cancer CNN research.

Abstract

Convolutional Neural Networks have shown promising effectiveness in identifying different types of cancer from radiographs. However, the opaque nature of CNNs makes it difficult to fully understand the way they operate, limiting their assessment to empirical evaluation. Here we study the soundness of the standard practices by which CNNs are evaluated for the purpose of cancer pathology. Thirteen highly used cancer benchmark datasets were analyzed, using four common CNN architectures and different types of cancer, such as melanoma, carcinoma, colorectal cancer, and lung cancer. We compared the accuracy of each model with that of datasets made of cropped segments from the background of the original images that do not contain clinically relevant content. Because the rendered datasets contain no clinical information, the null hypothesis is that the CNNs should provide mere chance-based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI in cancer detection · COVID-19 diagnosis using AI · Radiomics and Machine Learning in Medical Imaging