Cross-Stage Attention Propagation for Efficient Semantic Segmentation

Beoungwoo Kang

arXiv:2604.05431·cs.CV·April 8, 2026

Cross-Stage Attention Propagation for Efficient Semantic Segmentation

Beoungwoo Kang

PDF

TL;DR

The paper introduces Cross-Stage Attention Propagation (CSAP), a novel decoder framework for semantic segmentation that reduces computational cost by propagating attention maps across feature scales, achieving high accuracy with fewer FLOPs.

Contribution

CSAP is a new attention propagation method that computes attention at the deepest scale and efficiently propagates it to shallower stages, improving efficiency and accuracy.

Findings

01

CSAP-Tiny achieves 42.9% mIoU on ADE20K with 5.5 GFLOPs.

02

CSAP surpasses SegNeXt-Tiny by +1.8% on ADE20K while using 16.8% fewer FLOPs.

03

CSAP achieves 80.5% mIoU on Cityscapes with 21.5 GFLOPs.

Abstract

Recent lightweight semantic segmentation methods have made significant progress by combining compact backbones with efficient decoder heads. However, most multi-scale decoders compute attention independently at each feature scale, introducing substantial redundancy since the resulting attention distributions across scales are strongly correlated. We propose Cross-Stage Attention Propagation (CSAP), a decoder framework that computes attention at the deepest feature scale and propagates the resulting attention maps to shallower stages, bypassing query-key computation at those stages entirely. This design preserves multi-scale contextual reasoning while substantially reducing the decoder's computational cost. CSAP-Tiny achieves 42.9% mIoU on ADE20K with only 5.5 GFLOPs, 80.5% on Cityscapes with 21.5 GFLOPs, and 40.9% on COCO-Stuff 164K with 5.5 GFLOPs, surpassing SegNeXt-Tiny by +1.8% on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.