Graph-Segmenter: Graph Transformer with Boundary-aware Attention for Semantic Segmentation
Zizhang Wu, Yuanzhu Gan, Tianhao Xu, Fan Wang

TL;DR
Graph-Segmenter introduces a novel graph transformer with boundary-aware attention that models global and local relations for improved semantic segmentation, achieving state-of-the-art results on multiple datasets.
Contribution
It presents a new network combining graph transformer and boundary-aware attention to enhance relation modeling and boundary refinement in semantic segmentation.
Findings
Achieves state-of-the-art performance on Cityscapes, ADE-20k, and PASCAL Context datasets.
Effectively models global window relations and local pixel boundaries.
Demonstrates significant improvement over existing transformer-based segmentation methods.
Abstract
The transformer-based semantic segmentation approaches, which divide the image into different regions by sliding windows and model the relation inside each window, have achieved outstanding success. However, since the relation modeling between windows was not the primary emphasis of previous work, it was not fully utilized. To address this issue, we propose a Graph-Segmenter, including a Graph Transformer and a Boundary-aware Attention module, which is an effective network for simultaneously modeling the more profound relation between windows in a global view and various pixels inside each window as a local one, and for substantial low-cost boundary adjustment. Specifically, we treat every window and pixel inside the window as nodes to construct graphs for both views and devise the Graph Transformer. The introduced boundary-aware attention module optimizes the edge information of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsMulti-Head Attention · Attention Is All You Need · Adam · Softmax · Label Smoothing · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Layer Normalization · Linear Layer · Residual Connection
