Feature Selective Transformer for Semantic Image Segmentation
Fangjian Lin, Tianyi Wu, Sitong Wu, Shengwei Tian, Guodong Guo

TL;DR
This paper introduces FeSeFormer, a novel Transformer-based model that adaptively fuses multi-scale features for semantic image segmentation, significantly improving performance on multiple benchmarks.
Contribution
The paper proposes a Feature Selective Transformer with Scale-level Feature Selection and Full-scale Feature Fusion modules for enhanced multi-scale feature modeling.
Findings
Outperforms state-of-the-art on four semantic segmentation benchmarks.
Effectively selects informative features for each scale.
Achieves superior segmentation accuracy across datasets.
Abstract
Recently, it has attracted more and more attentions to fuse multi-scale features for semantic image segmentation. Various works were proposed to employ progressive local or global fusion, but the feature fusions are not rich enough for modeling multi-scale context features. In this work, we focus on fusing multi-scale features from Transformer-based backbones for semantic segmentation, and propose a Feature Selective Transformer (FeSeFormer), which aggregates features from all scales (or levels) for each query feature. Specifically, we first propose a Scale-level Feature Selection (SFS) module, which can choose an informative subset from the whole multi-scale feature set for each scale, where those features that are important for the current scale (or level) are selected and the redundant are discarded. Furthermore, we propose a Full-scale Feature Fusion (FFF) module, which can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications · Multimodal Machine Learning Applications
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Feature Selection · Adam · Residual Connection · Softmax · Dropout · Position-Wise Feed-Forward Layer · Dense Connections
