CFPNet: Channel-wise Feature Pyramid for Real-Time Semantic Segmentation

Ange Lou; Murray Loew

arXiv:2103.12212·cs.CV·June 4, 2021·5 cites

CFPNet: Channel-wise Feature Pyramid for Real-Time Semantic Segmentation

Ange Lou, Murray Loew

PDF

Open Access 2 Repos

TL;DR

CFPNet introduces a channel-wise feature pyramid module that effectively balances accuracy, model size, and speed for real-time semantic segmentation, suitable for mobile and autonomous driving applications.

Contribution

The paper proposes the CFP module and CFPNet architecture, achieving high performance with minimal parameters and fast inference for real-time segmentation.

Findings

01

Achieves 70.1% class-wise mIoU on Cityscapes

02

Uses only 0.55 million parameters

03

Runs at 30 FPS on a single GPU

Abstract

Real-time semantic segmentation is playing a more important role in computer vision, due to the growing demand for mobile devices and autonomous driving. Therefore, it is very important to achieve a good trade-off among performance, model size and inference speed. In this paper, we propose a Channel-wise Feature Pyramid (CFP) module to balance those factors. Based on the CFP module, we built CFPNet for real-time semantic segmentation which applied a series of dilated convolution channels to extract effective features. Experiments on Cityscapes and CamVid datasets show that the proposed CFPNet achieves an effective combination of those factors. For the Cityscapes test dataset, CFPNet achieves 70.1% class-wise mIoU with only 0.55 million parameters and 2.5 MB memory. The inference speed can reach 30 FPS on a single RTX 2080Ti GPU with a 1024x2048-pixel image.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Robotics and Sensor-Based Localization

MethodsConvolution · Dilated Convolution