ASAP: Accurate semantic segmentation for real time performance
Jaehyun Park, Subin Lee, Eon Kim, Byeongjun Moon, Dabeen Yu, Yeonseung, Yu, Junghwan Kim

TL;DR
This paper introduces FFDN, an efficient feature fusion method for semantic segmentation that preserves global context and reduces computational cost, achieving real-time performance in autonomous driving scenarios.
Contribution
The paper proposes FFDN, a novel feature fusion approach with vertical pooling and different norms, improving efficiency and accuracy in real-time semantic segmentation.
Findings
Achieves 73.1 mIoU on Cityscapes
Runs at 191 FPS in real-time
Comparable to state-of-the-art methods
Abstract
Feature fusion modules from encoder and self-attention module have been adopted in semantic segmentation. However, the computation of these modules is costly and has operational limitations in real-time environments. In addition, segmentation performance is limited in autonomous driving environments with a lot of contextual information perpendicular to the road surface, such as people, buildings, and general objects. In this paper, we propose an efficient feature fusion method, Feature Fusion with Different Norms (FFDN) that utilizes rich global context of multi-level scale and vertical pooling module before self-attention that preserves most contextual information while reducing the complexity of global context encoding in the vertical direction. By doing this, we could handle the properties of representation in global space and reduce additional computational cost. In addition, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Video Surveillance and Tracking Methods · Autonomous Vehicle Technology and Safety
MethodsTest
