S2M-Net: Spectral-Spatial Mixing for Medical Image Segmentation with Morphology-Aware Adaptive Loss
Md. Sanaullah Chowdhury Lameya Sabrin

TL;DR
S2M-Net is a lightweight medical image segmentation model that combines spectral-spatial mixing and morphology-aware adaptive loss to achieve state-of-the-art accuracy with fewer parameters.
Contribution
The paper introduces S2M-Net, a novel architecture with spectral and morphological innovations that outperform existing methods while reducing computational complexity.
Findings
Achieves 96.12% Dice on polyp segmentation
Outperforms prior art by 17.85% on surgical instruments
Uses 3.5-6 times fewer parameters than transformer-based models
Abstract
Medical image segmentation requires balancing local precision for boundary-critical clinical applications, global context for anatomical coherence, and computational efficiency for deployment on limited data and hardware a trilemma that existing architectures fail to resolve. Although convolutional networks provide local precision at cost but limited receptive fields, vision transformers achieve global context through self-attention at prohibitive computational expense, causing overfitting on small clinical datasets. We propose S2M-Net, a 4.7M-parameter architecture that achieves global context through two synergistic innovations: (i) Spectral-Selective Token Mixer (SSTM), which exploits the spectral concentration of medical images via truncated 2D FFT with learnable frequency filtering and content-gated spatial projection,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
