Fast-SCNN: Fast Semantic Segmentation Network
Rudra P K Poudel, Stephan Liwicki, Roberto Cipolla

TL;DR
Fast-SCNN is a real-time semantic segmentation network optimized for embedded devices, combining multi-resolution features for high accuracy and speed without large-scale pre-training.
Contribution
Introduces a novel 'learning to downsample' module and demonstrates a fast, accurate segmentation model suitable for embedded systems without needing extensive pre-training.
Findings
Achieves 68.0% mIoU at 123.5 fps on Cityscapes.
Operates efficiently on high-resolution images with low memory.
Pre-training on large datasets is unnecessary for good performance.
Abstract
The encoder-decoder framework is state-of-the-art for offline semantic image segmentation. Since the rise in autonomous systems, real-time computation is increasingly desirable. In this paper, we introduce fast segmentation convolutional neural network (Fast-SCNN), an above real-time semantic segmentation model on high resolution image data (1024x2048px) suited to efficient computation on embedded devices with low memory. Building on existing two-branch methods for fast segmentation, we introduce our `learning to downsample' module which computes low-level features for multiple resolution branches simultaneously. Our network combines spatial detail at high resolution with deep features extracted at lower resolution, yielding an accuracy of 68.0% mean intersection over union at 123.5 frames per second on Cityscapes. We also show that large scale pre-training is unnecessary. We thoroughly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
