Delving Aleatoric Uncertainty in Medical Image Segmentation via Vision Foundation Models

Ruiyang Li; Fang Liu; Licheng Jiao; Xinglin Xie; Jiayao Hao; Shuo Li; Xu Liu; Jingyi Yang; Lingling Li; Puhua Chen; Wenping Ma

arXiv:2604.10963·cs.AI·April 14, 2026

Delving Aleatoric Uncertainty in Medical Image Segmentation via Vision Foundation Models

Ruiyang Li, Fang Liu, Licheng Jiao, Xinglin Xie, Jiayao Hao, Shuo Li, Xu Liu, Jingyi Yang, Lingling Li, Puhua Chen, Wenping Ma

PDF

TL;DR

This paper introduces a novel approach leveraging vision foundation models to estimate and utilize aleatoric uncertainty in medical image segmentation, improving robustness and accuracy across multiple datasets.

Contribution

It systematically explores intrinsic data uncertainty using feature diversity and singular value energy, proposing uncertainty-aware filtering and training strategies.

Findings

01

Significant performance improvements on five public datasets.

02

Effective noise filtering and adaptive loss weighting enhance model robustness.

03

Broad applicability across different network architectures.

Abstract

Medical image segmentation supports clinical workflows by precisely delineating anatomical structures and lesions. However, medical image datasets medical image datasets suffer from acquisition noise and annotation ambiguity, causing pervasive data uncertainty that substantially undermines model robustness. Existing research focuses primarily on model architectural improvements and predictive reliability estimation, while systematic exploration of the intrinsic data uncertainty remains insufficient. To address this gap, this work proposes leveraging the universal representation capabilities of visual foundation models to estimate inherent data uncertainty. Specifically, we analyze the feature diversity of the model's decoded representations and quantify their singular value energy to define the semantic perception scale for each class, thereby measuring sample difficulty and aleatoric…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.