Enhancing Weakly Supervised 3D Medical Image Segmentation through Probabilistic-aware Learning

Runmin Jiang; Zhaoxin Fan; Junhao Wu; Lenghan Zhu; Xin Huang; Tianyang Wang; Heng Huang; Min Xu

arXiv:2403.02566·eess.IV·June 23, 2025·1 cites

Enhancing Weakly Supervised 3D Medical Image Segmentation through Probabilistic-aware Learning

Runmin Jiang, Zhaoxin Fan, Junhao Wu, Lenghan Zhu, Xin Huang, Tianyang Wang, Heng Huang, Min Xu

PDF

Open Access 1 Repo 4 Reviews

TL;DR

This paper introduces a probabilistic-aware weakly supervised learning pipeline for 3D medical image segmentation, combining pseudo label generation, probabilistic self-attention, and confidence-based loss to improve accuracy with sparse annotations.

Contribution

The novel pipeline integrates probabilistic pseudo labels, a probabilistic transformer network, and a confidence-aware loss for improved weakly supervised 3D segmentation.

Findings

01

Achieves up to 18.1% improvement in Dice scores over existing methods.

02

Rivals fully supervised segmentation performance.

03

Effective on CT and MRI datasets.

Abstract

3D medical image segmentation is a challenging task with crucial implications for disease diagnosis and treatment planning. Recent advances in deep learning have significantly enhanced fully supervised medical image segmentation. However, this approach heavily relies on labor-intensive and time-consuming fully annotated ground-truth labels, particularly for 3D volumes. To overcome this limitation, we propose a novel probabilistic-aware weakly supervised learning pipeline, specifically designed for 3D medical imaging. Our pipeline integrates three innovative components: a Probability-based Pseudo Label Generation technique for synthesizing dense segmentation masks from sparse annotations, a Probabilistic Multi-head Self-Attention network for robust feature extraction within our Probabilistic Transformer Network, and a Probability-informed Segmentation Loss Function to enhance training…

Peer Reviews

Decision·ICLR 2024 Conference Withdrawn Submission

Reviewer 01Rating 3· reject, not good enoughConfidence 4

Strengths

- The proposed methodology looks reasonable. It makes sense to use probabilistic-aware learning in a weakly supervised settings for 3D medical image segmentation which could reduce the annotation cost. - The paper leverages recent powerful techniques (e.g., UNETR, etc..) and aims to improve with a Gaussian-based multi-head self-attention mechanism. - The paper provides ablation study and comparisons with the existing methods with a known public dataset. - The paper is well-written and easy-to-r

Weaknesses

- The major weakness is the evaluation. I think that the experiments are simulated and may not be reflective of the real world. In the process of sparse label generation, the sampling algorithm produces evenly distributed points as annotations, but this may not be realistic and practical. It is possible that the annotation generation process have a major impact on the overall performance increase. - Another problem with the evaluation is that the results in the paper come from the model's perfor

Reviewer 02Rating 3· reject, not good enoughConfidence 3

Strengths

(1)This work is aimed to solve the interesting problem of weakly-supervised 3D segmentation. (2)The manuscript is well wirtten and clearly organized.

Weaknesses

(1) Only evenly distributed annotations are evaluated, which is unavailabel in practice. The proposed method tackles the sparsely annotated points. As described in Sec 3.2.1., the annotators are instructed to select 3D points **evenly distributed on the surface of the targe organ**. However, in practice, the weak annotations exist in hospital systems might be distributed irregularly. If the proposed method is only evaluated on these evenly distributed annotations, it might lost the generalizatio

Reviewer 03Rating 3· reject, not good enoughConfidence 5

Strengths

1. The approach has demonstrated commendable results when evaluated on the BTCV dataset, showcasing its potential utility. 2. An advantage of the proposed method is its reliance on annotations that are more resource-efficient than exhaustive voxel-level annotations. 3. The paper stands out for its coherent organization and clear presentation of concepts.

Weaknesses

1. The paper's technical contribution seems limited. Notably, the described Probabilistic Multi-head Self-Attention (PMSA) mirrors the design presented by Guo et al. (2022). Moreover, leveraging probability for modulating the Cross Entropy loss isn't a novel endeavor in the domain. 2. A glaring omission is the absence of comparative analyses with established weakly-supervised semantic segmentation techniques, especially those harnessing Class Activation Mapping (CAM). 3. Regarding the labeling t

Reviewer 04Rating 3· reject, not good enoughConfidence 4

Strengths

The paper presents a probabilistic-aware pipeline tailored for 3D medical image segmentation. The writing flow is clear.

Weaknesses

For probability-based pseudo label generation scheme part, this work samples 3d point on the surface of the target organ. Instead of sampling, why not directly obtain dense annotations using the organ surface? The segmentation ground truth is nearly identical to the organ surface, making it unreasonable to use weakly supervised learning when ground truth annotations are available. Sections 3.3 and 3.4 exhibit similarities with VAE at a high level. It is unclear why a probabilistic approach is u

Code & Models

Repositories

runminjiang/pw4medseg
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI in cancer detection · Medical Image Segmentation Techniques · Image Retrieval and Classification Techniques

MethodsAttention Is All You Need · Linear Layer · Layer Normalization · Byte Pair Encoding · Dropout · Multi-Head Attention · Softmax · Dense Connections · Label Smoothing · Adam