ShuffleSeg: Real-time Semantic Segmentation Network
Mostafa Gamal, Mennatullah Siam, Moemen Abdel-Razek

TL;DR
ShuffleSeg is a computationally efficient real-time semantic segmentation network utilizing grouped convolution and channel shuffling, achieving a good balance of speed and accuracy suitable for mobile and robotics applications.
Contribution
It introduces ShuffleSeg, a novel segmentation architecture optimized for real-time performance with significant FLOPs reduction and competitive accuracy.
Findings
Achieves 2x GFLOPs reduction compared to previous methods.
Maintains 58.3% mean IoU on CityScapes test set.
Runs at 15.7 fps on NVIDIA Jetson TX2.
Abstract
Real-time semantic segmentation is of significant importance for mobile and robotics related applications. We propose a computationally efficient segmentation network which we term as ShuffleSeg. The proposed architecture is based on grouped convolution and channel shuffling in its encoder for improving the performance. An ablation study of different decoding methods is compared including Skip architecture, UNet, and Dilation Frontend. Interesting insights on the speed and accuracy tradeoff is discussed. It is shown that skip architecture in the decoding method provides the best compromise for the goal of real-time performance, while it provides adequate accuracy by utilizing higher resolution feature maps for a more accurate segmentation. ShuffleSeg is evaluated on CityScapes and compared against the state of the art real-time segmentation networks. It achieves 2x GFLOPs reduction,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · 1x1 Convolution · Grouped Convolution · Convolution
