CFPNet: Channel-wise Feature Pyramid for Real-Time Semantic Segmentation
Ange Lou, Murray Loew

TL;DR
CFPNet introduces a channel-wise feature pyramid module that effectively balances accuracy, model size, and speed for real-time semantic segmentation, suitable for mobile and autonomous driving applications.
Contribution
The paper proposes the CFP module and CFPNet architecture, achieving high performance with minimal parameters and fast inference for real-time segmentation.
Findings
Achieves 70.1% class-wise mIoU on Cityscapes
Uses only 0.55 million parameters
Runs at 30 FPS on a single GPU
Abstract
Real-time semantic segmentation is playing a more important role in computer vision, due to the growing demand for mobile devices and autonomous driving. Therefore, it is very important to achieve a good trade-off among performance, model size and inference speed. In this paper, we propose a Channel-wise Feature Pyramid (CFP) module to balance those factors. Based on the CFP module, we built CFPNet for real-time semantic segmentation which applied a series of dilated convolution channels to extract effective features. Experiments on Cityscapes and CamVid datasets show that the proposed CFPNet achieves an effective combination of those factors. For the Cityscapes test dataset, CFPNet achieves 70.1% class-wise mIoU with only 0.55 million parameters and 2.5 MB memory. The inference speed can reach 30 FPS on a single RTX 2080Ti GPU with a 1024x2048-pixel image.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Robotics and Sensor-Based Localization
MethodsConvolution · Dilated Convolution
