Rethinking BiSeNet For Real-time Semantic Segmentation
Mingyuan Fan, Shenqi Lai, Junshi Huang, Xiaoming Wei, Zhenhua Chai,, Junfeng Luo, Xiaolin Wei

TL;DR
This paper introduces the STDC network, a novel efficient architecture for real-time semantic segmentation that reduces redundancy and improves speed and accuracy on Cityscapes and CamVid datasets.
Contribution
The paper proposes the STDC network with a new feature aggregation method and a Detail Aggregation module, enhancing efficiency and segmentation performance.
Findings
Achieves 71.9% mIoU at 250.4 FPS on Cityscapes
Outperforms recent methods by 45.2% in speed
Maintains high accuracy at higher resolutions
Abstract
BiSeNet has been proved to be a popular two-stream network for real-time segmentation. However, its principle of adding an extra path to encode spatial information is time-consuming, and the backbones borrowed from pretrained tasks, e.g., image classification, may be inefficient for image segmentation due to the deficiency of task-specific design. To handle these problems, we propose a novel and efficient structure named Short-Term Dense Concatenate network (STDC network) by removing structure redundancy. Specifically, we gradually reduce the dimension of feature maps and use the aggregation of them for image representation, which forms the basic module of STDC network. In the decoder, we propose a Detail Aggregation module by integrating the learning of spatial information into low-level layers in single-stream manner. Finally, the low-level features and deep features are fused to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · 1x1 Convolution · Convolution · Short-Term Dense Concatenate
