Attentive Symmetric Autoencoder for Brain MRI Segmentation
Junjia Huang, Haofeng Li, Guanbin Li, Xiang Wan

TL;DR
This paper introduces a novel Attentive Symmetric Auto-encoder based on Vision Transformer for 3D brain MRI segmentation, emphasizing informative and symmetric regions to improve feature learning and segmentation accuracy.
Contribution
It proposes a new auto-encoder that uses gradient-based importance and symmetric position encoding to better capture brain structure features in MRI segmentation.
Findings
Outperforms state-of-the-art self-supervised methods
Effective in capturing symmetric brain regions
Improves segmentation accuracy on benchmarks
Abstract
Self-supervised learning methods based on image patch reconstruction have witnessed great success in training auto-encoders, whose pre-trained weights can be transferred to fine-tune other downstream tasks of image understanding. However, existing methods seldom study the various importance of reconstructed patches and the symmetry of anatomical structures, when they are applied to 3D medical images. In this paper we propose a novel Attentive Symmetric Auto-encoder (ASA) based on Vision Transformer (ViT) for 3D brain MRI segmentation tasks. We conjecture that forcing the auto-encoder to recover informative image regions can harvest more discriminative representations, than to recover smooth image patches. Then we adopt a gradient based metric to estimate the importance of each image patch. In the pre-training stage, the proposed auto-encoder pays more attention to reconstruct the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Image Segmentation Techniques · Advanced Neural Network Applications · Brain Tumor Detection and Classification
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Adam · Softmax · Dropout · Vision Transformer · Residual Connection
