VoCo: A Simple-yet-Effective Volume Contrastive Learning Framework for 3D Medical Image Analysis
Linshan Wu, Jiaxin Zhuang, Hao Chen

TL;DR
VoCo introduces a simple contrastive learning framework that leverages the consistent spatial relationships in 3D medical images to improve high-level semantic understanding in downstream tasks without annotations.
Contribution
The paper proposes a novel Volume Contrast (VoCo) framework that uses contextual position priors for self-supervised pre-training in 3D medical image analysis, enhancing semantic representations.
Findings
Outperforms existing methods on six downstream tasks
Effectively encodes contextual position priors without annotations
Improves high-level semantic understanding in 3D medical images
Abstract
Self-Supervised Learning (SSL) has demonstrated promising results in 3D medical image analysis. However, the lack of high-level semantics in pre-training still heavily hinders the performance of downstream tasks. We observe that 3D medical images contain relatively consistent contextual position information, i.e., consistent geometric relations between different organs, which leads to a potential way for us to learn consistent semantic representations in pre-training. In this paper, we propose a simple-yet-effective Volume Contrast (VoCo) framework to leverage the contextual position priors for pre-training. Specifically, we first generate a group of base crops from different regions while enforcing feature discrepancy among them, where we employ them as class assignments of different regions. Then, we randomly crop sub-volumes and predict them belonging to which class (located at which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗AnonRes/PrimusM-OpenMind-MAEmodel· 1 dl1 dl
- 🤗AnonRes/ResEncL-OpenMind-MAEmodel· 19 dl· ♡ 119 dl♡ 1
- 🤗AnonRes/ResEncL-OpenMind-S3Dmodel· 11 dl11 dl
- 🤗AnonRes/ResEncL-OpenMind-VFmodel· 6 dl6 dl
- 🤗AnonRes/ResEncL-OpenMind-VoComodel· 6 dl6 dl
- 🤗AnonRes/ResEncL-OpenMind-MGmodel
- 🤗AnonRes/ResEncL-OpenMind-SimCLRmodel· 2 dl2 dl
- 🤗AnonRes/ResEncL-OpenMind-SwinUNETRmodel
- 🤗AnonRes/PrimusM-OpenMind-SimMIMmodel· 4 dl4 dl
- 🤗AnonRes/PrimusM-OpenMind-MGmodel
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Image Segmentation Techniques · AI in cancer detection · Radiomics and Machine Learning in Medical Imaging
MethodsBalanced Selection
