Enhancing Weakly Supervised 3D Medical Image Segmentation through Probabilistic-aware Learning
Runmin Jiang, Zhaoxin Fan, Junhao Wu, Lenghan Zhu, Xin Huang, Tianyang Wang, Heng Huang, Min Xu

TL;DR
This paper introduces a probabilistic-aware weakly supervised learning pipeline for 3D medical image segmentation, combining pseudo label generation, probabilistic self-attention, and confidence-based loss to improve accuracy with sparse annotations.
Contribution
The novel pipeline integrates probabilistic pseudo labels, a probabilistic transformer network, and a confidence-aware loss for improved weakly supervised 3D segmentation.
Findings
Achieves up to 18.1% improvement in Dice scores over existing methods.
Rivals fully supervised segmentation performance.
Effective on CT and MRI datasets.
Abstract
3D medical image segmentation is a challenging task with crucial implications for disease diagnosis and treatment planning. Recent advances in deep learning have significantly enhanced fully supervised medical image segmentation. However, this approach heavily relies on labor-intensive and time-consuming fully annotated ground-truth labels, particularly for 3D volumes. To overcome this limitation, we propose a novel probabilistic-aware weakly supervised learning pipeline, specifically designed for 3D medical imaging. Our pipeline integrates three innovative components: a Probability-based Pseudo Label Generation technique for synthesizing dense segmentation masks from sparse annotations, a Probabilistic Multi-head Self-Attention network for robust feature extraction within our Probabilistic Transformer Network, and a Probability-informed Segmentation Loss Function to enhance training…
Peer Reviews
Decision·ICLR 2024 Conference Withdrawn Submission
- The proposed methodology looks reasonable. It makes sense to use probabilistic-aware learning in a weakly supervised settings for 3D medical image segmentation which could reduce the annotation cost. - The paper leverages recent powerful techniques (e.g., UNETR, etc..) and aims to improve with a Gaussian-based multi-head self-attention mechanism. - The paper provides ablation study and comparisons with the existing methods with a known public dataset. - The paper is well-written and easy-to-r
- The major weakness is the evaluation. I think that the experiments are simulated and may not be reflective of the real world. In the process of sparse label generation, the sampling algorithm produces evenly distributed points as annotations, but this may not be realistic and practical. It is possible that the annotation generation process have a major impact on the overall performance increase. - Another problem with the evaluation is that the results in the paper come from the model's perfor
(1)This work is aimed to solve the interesting problem of weakly-supervised 3D segmentation. (2)The manuscript is well wirtten and clearly organized.
(1) Only evenly distributed annotations are evaluated, which is unavailabel in practice. The proposed method tackles the sparsely annotated points. As described in Sec 3.2.1., the annotators are instructed to select 3D points **evenly distributed on the surface of the targe organ**. However, in practice, the weak annotations exist in hospital systems might be distributed irregularly. If the proposed method is only evaluated on these evenly distributed annotations, it might lost the generalizatio
1. The approach has demonstrated commendable results when evaluated on the BTCV dataset, showcasing its potential utility. 2. An advantage of the proposed method is its reliance on annotations that are more resource-efficient than exhaustive voxel-level annotations. 3. The paper stands out for its coherent organization and clear presentation of concepts.
1. The paper's technical contribution seems limited. Notably, the described Probabilistic Multi-head Self-Attention (PMSA) mirrors the design presented by Guo et al. (2022). Moreover, leveraging probability for modulating the Cross Entropy loss isn't a novel endeavor in the domain. 2. A glaring omission is the absence of comparative analyses with established weakly-supervised semantic segmentation techniques, especially those harnessing Class Activation Mapping (CAM). 3. Regarding the labeling t
The paper presents a probabilistic-aware pipeline tailored for 3D medical image segmentation. The writing flow is clear.
For probability-based pseudo label generation scheme part, this work samples 3d point on the surface of the target organ. Instead of sampling, why not directly obtain dense annotations using the organ surface? The segmentation ground truth is nearly identical to the organ surface, making it unreasonable to use weakly supervised learning when ground truth annotations are available. Sections 3.3 and 3.4 exhibit similarities with VAE at a high level. It is unclear why a probabilistic approach is u
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Medical Image Segmentation Techniques · Image Retrieval and Classification Techniques
MethodsAttention Is All You Need · Linear Layer · Layer Normalization · Byte Pair Encoding · Dropout · Multi-Head Attention · Softmax · Dense Connections · Label Smoothing · Adam
