VoCo: A Simple-yet-Effective Volume Contrastive Learning Framework for   3D Medical Image Analysis

Linshan Wu; Jiaxin Zhuang; Hao Chen

arXiv:2402.17300·eess.IV·April 19, 2024·CVPR·1 cites

VoCo: A Simple-yet-Effective Volume Contrastive Learning Framework for 3D Medical Image Analysis

Linshan Wu, Jiaxin Zhuang, Hao Chen

PDF

Open Access 1 Repo 10 Models 2 Datasets

TL;DR

VoCo introduces a simple contrastive learning framework that leverages the consistent spatial relationships in 3D medical images to improve high-level semantic understanding in downstream tasks without annotations.

Contribution

The paper proposes a novel Volume Contrast (VoCo) framework that uses contextual position priors for self-supervised pre-training in 3D medical image analysis, enhancing semantic representations.

Findings

01

Outperforms existing methods on six downstream tasks

02

Effectively encodes contextual position priors without annotations

03

Improves high-level semantic understanding in 3D medical images

Abstract

Self-Supervised Learning (SSL) has demonstrated promising results in 3D medical image analysis. However, the lack of high-level semantics in pre-training still heavily hinders the performance of downstream tasks. We observe that 3D medical images contain relatively consistent contextual position information, i.e., consistent geometric relations between different organs, which leads to a potential way for us to learn consistent semantic representations in pre-training. In this paper, we propose a simple-yet-effective Volume Contrast (VoCo) framework to leverage the contextual position priors for pre-training. Specifically, we first generate a group of base crops from different regions while enforcing feature discrepancy among them, where we employ them as class assignments of different regions. Then, we randomly crop sub-volumes and predict them belonging to which class (located at which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

luffy03/voco
pytorchOfficial

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMedical Image Segmentation Techniques · AI in cancer detection · Radiomics and Machine Learning in Medical Imaging

MethodsBalanced Selection