A Deep Learning Approach Using Masked Image Modeling for Reconstruction of Undersampled K-spaces
Kyler Larsen, Arghya Pal, Yogesh Rathi

TL;DR
This paper introduces a deep learning method using masked image modeling with vision transformers to accurately reconstruct fully sampled MRI images from undersampled k spaces, significantly reducing scan times.
Contribution
It is the first to apply masked image modeling with Swin transformers for MRI k space reconstruction, achieving high accuracy and structural similarity.
Findings
Reconstructed images had L1 loss <0.01
Structural similarity over 99% with fully sampled images
Validation loss decreased steadily during training
Abstract
Magnetic Resonance Imaging (MRI) scans are time consuming and precarious, since the patients remain still in a confined space for extended periods of time. To reduce scanning time, some experts have experimented with undersampled k spaces, trying to use deep learning to predict the fully sampled result. These studies report that as many as 20 to 30 minutes could be saved off a scan that takes an hour or more. However, none of these studies have explored the possibility of using masked image modeling (MIM) to predict the missing parts of MRI k spaces. This study makes use of 11161 reconstructed MRI and k spaces of knee MRI images from Facebook's fastmri dataset. This tests a modified version of an existing model using baseline shifted window (Swin) and vision transformer architectures that makes use of MIM on undersampled k spaces to predict the full k space and consequently the full MRI…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · Medical Imaging and Analysis · Medical Imaging Techniques and Applications
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Residual Connection · Softmax · Layer Normalization · Dense Connections · Vision Transformer · Mutual Information Machine/Mask Image Modeling · Gradient Normalization
