HUMUS-Net: Hybrid unrolled multi-scale network architecture for accelerated MRI reconstruction
Zalan Fabian, Berk Tinaz, Mahdi Soltanolkotabi

TL;DR
HUMUS-Net is a hybrid deep learning architecture combining convolutions and Transformers in a multi-scale, unrolled network to improve accelerated MRI reconstruction, achieving state-of-the-art results on large datasets.
Contribution
It introduces HUMUS-Net, a novel hybrid architecture that efficiently integrates convolutional and Transformer modules for high-quality MRI reconstruction.
Findings
Achieves state-of-the-art performance on fastMRI dataset.
Demonstrates superior reconstruction quality on multiple MRI datasets.
Validates the effectiveness of multi-scale Transformer integration.
Abstract
In accelerated MRI reconstruction, the anatomy of a patient is recovered from a set of under-sampled and noisy measurements. Deep learning approaches have been proven to be successful in solving this ill-posed inverse problem and are capable of producing very high quality reconstructions. However, current architectures heavily rely on convolutions, that are content-independent and have difficulties modeling long-range dependencies in images. Recently, Transformers, the workhorse of contemporary natural language processing, have emerged as powerful building blocks for a multitude of vision tasks. These models split input images into non-overlapping patches, embed the patches into lower-dimensional tokens and utilize a self-attention mechanism that does not suffer from the aforementioned weaknesses of convolutional architectures. However, Transformers incur extremely high compute and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced MRI Techniques and Applications · Advanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis
MethodsAttention Is All You Need · Linear Layer · Byte Pair Encoding · Softmax · Position-Wise Feed-Forward Layer · Multi-Head Attention · Dense Connections · Residual Connection · Dropout · Layer Normalization
