FreqU-FNet: Frequency-Aware U-Net for Imbalanced Medical Image Segmentation
Ruiqi Xing

TL;DR
FreqU-FNet introduces a frequency domain U-Net architecture with spectral feature extraction and a frequency-aware loss, significantly improving imbalanced medical image segmentation especially for minority classes.
Contribution
It presents a novel frequency domain segmentation framework with multi-scale spectral features and a frequency-aware loss, addressing class imbalance in medical imaging.
Findings
Outperforms CNN and Transformer baselines on multiple benchmarks.
Effectively captures minority class signals through spectral features.
Enhances segmentation accuracy for under-represented classes.
Abstract
Medical image segmentation faces persistent challenges due to severe class imbalance and the frequency-specific distribution of anatomical structures. Most conventional CNN-based methods operate in the spatial domain and struggle to capture minority class signals, often affected by frequency aliasing and limited spectral selectivity. Transformer-based models, while powerful in modeling global dependencies, tend to overlook critical local details necessary for fine-grained segmentation. To overcome these limitations, we propose FreqU-FNet, a novel U-shaped segmentation architecture operating in the frequency domain. Our framework incorporates a Frequency Encoder that leverages Low-Pass Frequency Convolution and Daubechies wavelet-based downsampling to extract multi-scale spectral features. To reconstruct fine spatial details, we introduce a Spatial Learnable Decoder (SLD) equipped with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAttention Is All You Need · Linear Layer · Layer Normalization · Byte Pair Encoding · Residual Connection · Dense Connections · Convolution · Softmax · Position-Wise Feed-Forward Layer · Absolute Position Encodings
