Keypoint-Augmented Self-Supervised Learning for Medical Image Segmentation with Limited Annotation
Zhangsihao Yang, Mengwei Ren, Kaize Ding, Guido Gerig, Yalin Wang

TL;DR
This paper introduces a keypoint-augmented self-supervised learning framework for medical image segmentation that captures both short- and long-range spatial dependencies, improving performance with limited annotations.
Contribution
It proposes a novel fusion layer incorporating keypoints for enhanced self-attention and a dual-scale pretraining strategy, advancing medical image segmentation accuracy.
Findings
Outperforms CNN and Transformer-based UNets on MRI and CT segmentation.
Achieves state-of-the-art results with limited annotations.
Provides more robust self-attention mechanisms.
Abstract
Pretraining CNN models (i.e., UNet) through self-supervision has become a powerful approach to facilitate medical image segmentation under low annotation regimes. Recent contrastive learning methods encourage similar global representations when the same image undergoes different transformations, or enforce invariance across different image/patch features that are intrinsically correlated. However, CNN-extracted global and local features are limited in capturing long-range spatial dependencies that are essential in biological anatomy. To this end, we present a keypoint-augmented fusion layer that extracts representations preserving both short- and long-range self-attention. In particular, we augment the CNN feature map at multiple scales by incorporating an additional input that learns long-range spatial self-attention among localized keypoint features. Further, we introduce both global…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · AI in cancer detection · Medical Imaging and Analysis
