MacFormer: Semantic Segmentation with Fine Object Boundaries
Guoan Xu, Wenfeng Huang, Tao Wu, Ligeng Chen, Wenjing Jia, Guangwei, Gao, Xiatian Zhu, Stuart Perry

TL;DR
MacFormer is a novel semantic segmentation architecture that improves boundary precision by integrating mutual cross-attention and frequency domain enhancement, achieving superior accuracy and efficiency on benchmark datasets.
Contribution
The paper introduces MacFormer, combining learnable agent tokens with a frequency enhancement module to better preserve object boundaries in semantic segmentation.
Findings
Outperforms existing methods on ADE20K and Cityscapes datasets.
Effectively preserves low-level features like edges during decoding.
Compatible with various network architectures and improves accuracy with minimal computational overhead.
Abstract
Semantic segmentation involves assigning a specific category to each pixel in an image. While Vision Transformer-based models have made significant progress, current semantic segmentation methods often struggle with precise predictions in localized areas like object boundaries. To tackle this challenge, we introduce a new semantic segmentation architecture, ``MacFormer'', which features two key components. Firstly, using learnable agent tokens, a Mutual Agent Cross-Attention (MACA) mechanism effectively facilitates the bidirectional integration of features across encoder and decoder layers. This enables better preservation of low-level features, such as elementary edges, during decoding. Secondly, a Frequency Enhancement Module (FEM) in the decoder leverages high-frequency and low-frequency components to boost features in the frequency domain, benefiting object boundaries with minimal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
