# Open-source framework for detecting bias and overfitting for large pathology images

**Authors:** Anders Sildnes, Nikita Shvetsov, Masoud Tafavvoghi, Vi Ngoc-Nha Tran, Kajsa Møllersen, Lill-Tove Rasmussen Busund, Thomas K. Kilvær, Lars Ailo Bongo

PMC · DOI: 10.1371/journal.pone.0341715 · PLOS One · 2026-02-19

## TL;DR

This paper introduces an open-source tool to detect and address bias and overfitting in deep learning models analyzing large pathology images.

## Contribution

The novel contribution is a model-architecture-agnostic framework for debugging deep learning models in histopathology.

## Key findings

- The framework successfully replicates known bias patterns in pre-trained and self-supervised models.
- It is computationally efficient and integrates with the MONAI framework for widespread use.
- The tool is open-source and tested on a widely used histopathology dataset.

## Abstract

Even foundational models trained on large-scale datasets may learn to rely on non-relevant artifacts such as background color or color intensity, leading to overfitting and/or bias. To ensure the robustness of deep learning applications, there is a need for methods to detect and remove the use of these artifacts. Existing debugging methods are often domain- and model-architecture-specific, and may be computationally expensive, hindering widespread use. We propose a model-architecture-agnostic framework to debug deep learning models. To demonstrate the utility of our framework, we test it using a widely used dataset from histopathology, which has been tested in other literature. The dataset features very large images that typically demand large computational resources. We demonstrate that the framework can replicate known bias patterns in a pre-trained foundation model (Phikon-v2) and a self-trained self-supervised model (MoCo v1). Our framework contributes to the development of more reliable, accurate, and generalizable models for WSI analysis, and is available as an open-source tool integrated with the MONAI framework at https://github.com/uit-hdl/feature-inspect.

## Full-text entities

- **Diseases:** breast cancer (MESH:D001943), WSI (MESH:C564543), LP (MESH:D017499), COVID (MESH:D000086382), CL (MESH:D007859), LUSC (MESH:D002294), cancer (MESH:D009369)
- **Chemicals:** GPU (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Mutations:** L40S
- **Cell lines:** Phikon-v2 — Mus musculus (Mouse), Spontaneously immortalized cell line (CVCL_B9UX)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12919778/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12919778/full.md

## References

68 references — full list in the complete paper: https://tomesphere.com/paper/PMC12919778/full.md

---
Source: https://tomesphere.com/paper/PMC12919778