Unlocking Fine-Grained Details with Wavelet-based High-Frequency Enhancement in Transformers
Reza Azad, Amirhossein Kazerouni, Alaa Sulaiman, Afshin Bozorgpour,, Ehsan Khodapanah Aghdam, Abin Jose, Dorit Merhof

TL;DR
This paper introduces a wavelet-based high-frequency enhancement method in Transformers to improve fine-grained feature extraction for medical image segmentation, leading to more accurate dense predictions.
Contribution
It re-designs the self-attention mechanism using wavelet decomposition and multi-scale context blocks to better capture boundary and texture details in medical images.
Findings
Improved segmentation accuracy on multi-organ benchmarks.
Effective boundary and texture detail preservation.
Enhanced multi-scale feature modeling.
Abstract
Medical image segmentation is a critical task that plays a vital role in diagnosis, treatment planning, and disease monitoring. Accurate segmentation of anatomical structures and abnormalities from medical images can aid in the early detection and treatment of various diseases. In this paper, we address the local feature deficiency of the Transformer model by carefully re-designing the self-attention map to produce accurate dense prediction in medical images. To this end, we first apply the wavelet transformation to decompose the input feature map into low-frequency (LF) and high-frequency (HF) subbands. The LF segment is associated with coarse-grained features while the HF components preserve fine-grained features such as texture and edge information. Next, we reformulate the self-attention operation using the efficient Transformer to perform both spatial and context attention on top…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Radiomics and Machine Learning in Medical Imaging · Cutaneous Melanoma Detection and Management
MethodsAttention Is All You Need · Linear Layer · Dropout · Byte Pair Encoding · Adam · Position-Wise Feed-Forward Layer · Multi-Head Attention · Absolute Position Encodings · Residual Connection · Label Smoothing
