A Point in the Right Direction: Vector Prediction for Spatially-aware   Self-supervised Volumetric Representation Learning

Yejia Zhang; Pengfei Gu; Nishchal Sapkota; Hao Zheng; Peixian Liang,; Danny Z. Chen

arXiv:2211.08533·cs.CV·November 17, 2022

A Point in the Right Direction: Vector Prediction for Spatially-aware Self-supervised Volumetric Representation Learning

Yejia Zhang, Pengfei Gu, Nishchal Sapkota, Hao Zheng, Peixian Liang,, Danny Z. Chen

PDF

Open Access

TL;DR

VectorPOSE introduces two novel self-supervised tasks, Vector Prediction and Boundary-Focused Reconstruction, to enhance spatial understanding in 3D medical imaging, leading to improved segmentation performance with limited labels.

Contribution

The paper proposes VectorPOSE, a new self-supervised method that incorporates spatially-aware pretext tasks for better 3D medical image representation learning.

Findings

01

Outperforms state-of-the-art methods on 3D segmentation tasks

02

Enhances learning in limited annotation scenarios

03

Improves both global and local spatial feature understanding

Abstract

High annotation costs and limited labels for dense 3D medical imaging tasks have recently motivated an assortment of 3D self-supervised pretraining methods that improve transfer learning performance. However, these methods commonly lack spatial awareness despite its centrality in enabling effective 3D image analysis. More specifically, position, scale, and orientation are not only informative but also automatically available when generating image crops for training. Yet, to date, no work has proposed a pretext task that distills all key spatial features. To fulfill this need, we develop a new self-supervised method, VectorPOSE, which promotes better spatial understanding with two novel pretext tasks: Vector Prediction (VP) and Boundary-Focused Reconstruction (BFR). VP focuses on global spatial concepts (i.e., properties of 3D patches) while BFR addresses weaknesses of recent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Cancer-related molecular mechanisms research