LoMAE: Low-level Vision Masked Autoencoders for Low-dose CT Denoising
Dayang Wang, Yongshun Xu, Shuo Han, Zhan Wu, Li Zhou, Bahareh, Morovati, Hengyong Yu

TL;DR
LoMAE is a novel low-level vision masked autoencoder designed for low-dose CT denoising, reducing reliance on paired data and improving robustness across various noise levels.
Contribution
The paper introduces LoMAE, a tailored masked autoencoder for low-dose CT denoising, and an MAE-GradCAM method to understand its learning mechanisms, addressing data scarcity issues.
Findings
Enhances transformer-based denoising performance.
Reduces dependence on clean ground truth data.
Shows robustness across different noise levels.
Abstract
Low-dose computed tomography (LDCT) offers reduced X-ray radiation exposure but at the cost of compromised image quality, characterized by increased noise and artifacts. Recently, transformer models emerged as a promising avenue to enhance LDCT image quality. However, the success of such models relies on a large amount of paired noisy and clean images, which are often scarce in clinical settings. In the fields of computer vision and natural language processing, masked autoencoders (MAE) have been recognized as an effective label-free self-pretraining method for transformers, due to their exceptional feature representation ability. However, the original pretraining and fine-tuning design fails to work in low-level vision tasks like denoising. In response to this challenge, we redesign the classical encoder-decoder learning model and facilitate a simple yet effective low-level vision MAE,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Imaging Techniques and Applications · Radiomics and Machine Learning in Medical Imaging · Advanced X-ray and CT Imaging
MethodsMasked autoencoder
