Disruptive Autoencoders: Leveraging Low-level features for 3D Medical   Image Pre-training

Jeya Maria Jose Valanarasu; Yucheng Tang; Dong Yang; Ziyue Xu; Can; Zhao; Wenqi Li; Vishal M. Patel; Bennett Landman; Daguang Xu; Yufan He,; Vishwesh Nath

arXiv:2307.16896·cs.CV·August 1, 2023·5 cites

Disruptive Autoencoders: Leveraging Low-level features for 3D Medical Image Pre-training

Jeya Maria Jose Valanarasu, Yucheng Tang, Dong Yang, Ziyue Xu, Can, Zhao, Wenqi Li, Vishal M. Patel, Bennett Landman, Daguang Xu, Yufan He,, Vishwesh Nath

PDF

Open Access 1 Repo

TL;DR

This paper introduces Disruptive Autoencoders, a novel pre-training framework for 3D medical images that leverages local masking, low-level perturbations, and cross-modal contrastive learning to improve representation and achieve state-of-the-art results.

Contribution

It proposes a new pre-training method combining local masking, low-level perturbations, and cross-modal contrastive loss for 3D medical images, addressing modality-specific challenges.

Findings

01

Achieves state-of-the-art performance on multiple downstream tasks.

02

Top of the BTCV multi-organ segmentation challenge leaderboard.

03

Effective learning of local and low-level features in 3D medical images.

Abstract

Harnessing the power of pre-training on large-scale datasets like ImageNet forms a fundamental building block for the progress of representation learning-driven solutions in computer vision. Medical images are inherently different from natural images as they are acquired in the form of many modalities (CT, MR, PET, Ultrasound etc.) and contain granulated information like tissue, lesion, organs etc. These characteristics of medical images require special attention towards learning features representative of local context. In this work, we focus on designing an effective pre-training framework for 3D radiology images. First, we propose a new masking strategy called local masking where the masking is performed across channel embeddings instead of tokens to improve the learning of local feature representations. We combine this with classical low-level perturbations like adding noise and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mrgiovanni/suprem
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRadiomics and Machine Learning in Medical Imaging · Medical Imaging and Analysis · AI in cancer detection

MethodsFocus