TL;DR
SegTTA is a training-free test-time augmentation framework that enhances zero-shot medical image segmentation by combining specific augmentations and voting, improving performance across various datasets.
Contribution
It introduces a novel, training-free test-time augmentation method that improves segmentation accuracy without retraining models, using multiple augmentations and weighted voting.
Findings
Consistent performance improvements across three diverse datasets.
Large organs benefit from intensity augmentations, small lesions from noise.
Achieves higher mIoU and aIoU, and reduces HD95 on hepatic vessel dataset.
Abstract
Increasingly advanced data augmentation techniques have greatly aided clinical medical research, increasing data diversity and improving model generalization capabilities. Although most current basic models exhibit strong generalization abilities, image quality varies due to differences in equipment and operators. To address these challenges, we present SegTTA, a framework that improves medical image segmentation without model retraining by combining four augmentations (Gamma correction, Contrast enhancement, Gaussian blur, Gaussian noise) with weighted voting across multiple MedSAM2 checkpoints. Experiments demonstrate consistent improvements across three diverse datasets: healthy uterus segmentation, uterine myoma detection, and multi class hepatic structure segmentation. Ablation studies reveal that large organs benefit from intensity augmentations while small lesions require noise…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
