Light-Weight RefineNet for Real-Time Semantic Segmentation

Vladimir Nekrasov; Chunhua Shen; Ian Reid

arXiv:1810.03272·cs.CV·October 9, 2018·101 cites

Light-Weight RefineNet for Real-Time Semantic Segmentation

Vladimir Nekrasov, Chunhua Shen, Ian Reid

PDF

Open Access 2 Repos

TL;DR

This paper presents a compact, real-time capable version of RefineNet for semantic segmentation, reducing model size and computation while maintaining high accuracy, suitable for high-resolution inputs and real-time applications.

Contribution

The authors adapt RefineNet into a more efficient model by identifying and modifying computationally expensive blocks, achieving over twofold reduction in parameters and FLOPs with minimal performance loss.

Findings

01

Model reduction over twofold with almost unchanged performance

02

Speed increase from 20 FPS to 55 FPS on 512x512 inputs

03

Achieved 79.2% mean IoU with only 3.3M parameters

Abstract

We consider an important task of effective and efficient semantic image segmentation. In particular, we adapt a powerful semantic segmentation architecture, called RefineNet, into the more compact one, suitable even for tasks requiring real-time performance on high-resolution inputs. To this end, we identify computationally expensive blocks in the original setup, and propose two modifications aimed to decrease the number of parameters and floating point operations. By doing that, we achieve more than twofold model reduction, while keeping the performance levels almost intact. Our fastest model undergoes a significant speed-up boost from 20 FPS to 55 FPS on a generic GPU card on 512x512 inputs with solid 81.1% mean iou performance on the test set of PASCAL VOC, while our slowest model with 32 FPS (from original 17 FPS) shows 82.7% mean iou on the same dataset. Alternatively, we showcase…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques