WaveSeg: Enhancing Segmentation Precision via High-Frequency Prior and Mamba-Driven Spectrum Decomposition
Guoan Xu, Yang Xiao, Wenjing Jia, Guangwei Gao, Guo-Jun Qi, and Chia-Wen Lin

TL;DR
WaveSeg introduces a novel decoder that combines wavelet domain priors and Mamba-based attention to improve segmentation accuracy by better preserving boundary details and semantic information.
Contribution
The paper proposes WaveSeg, a new decoder architecture that integrates high-frequency priors, spectrum decomposition attention, and Mamba-driven modeling for enhanced segmentation.
Findings
Outperforms state-of-the-art methods on standard benchmarks.
Effectively preserves boundary details and semantic integrity.
Achieves efficient and precise segmentation results.
Abstract
While recent semantic segmentation networks heavily rely on powerful pretrained encoders, most employ simplistic decoders, leading to suboptimal trade-offs between semantic context and fine-grained detail preservation. To address this, we propose a novel decoder architecture, WaveSeg, which jointly optimizes feature refinement in spatial and wavelet domains. Specifically, high-frequency components are first learned from input images as explicit priors to reinforce boundary details at early stages. A multi-scale fusion mechanism, Dual Domain Operation (DDO), is then applied, and the novel Spectrum Decomposition Attention (SDA) block is proposed, which is developed to leverage Mamba's linear-complexity long-range modeling to enhance high-frequency structural details. Meanwhile, reparameterized convolutions are applied to preserve low-frequency semantic integrity in the wavelet domain.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
