ICNet for Real-Time Semantic Segmentation on High-Resolution Images

Hengshuang Zhao; Xiaojuan Qi; Xiaoyong Shen; Jianping Shi; Jiaya Jia

arXiv:1704.08545·cs.CV·August 21, 2018·130 cites

ICNet for Real-Time Semantic Segmentation on High-Resolution Images

Hengshuang Zhao, Xiaojuan Qi, Xiaoyong Shen, Jianping Shi, Jiaya Jia

PDF

Open Access 5 Repos

TL;DR

This paper introduces ICNet, a real-time semantic segmentation network that efficiently processes high-resolution images by using multi-resolution branches and cascade feature fusion, enabling fast inference with good accuracy.

Contribution

The paper presents ICNet, a novel multi-resolution cascade network with feature fusion for real-time semantic segmentation of high-resolution images.

Findings

01

Real-time inference achieved on a single GPU.

02

Decent segmentation quality on Cityscapes, CamVid, and COCO-Stuff.

03

Efficient reduction of computation for pixel-wise labeling.

Abstract

We focus on the challenging task of real-time semantic segmentation in this paper. It finds many practical applications and yet is with fundamental difficulty of reducing a large portion of computation for pixel-wise label inference. We propose an image cascade network (ICNet) that incorporates multi-resolution branches under proper label guidance to address this challenge. We provide in-depth analysis of our framework and introduce the cascade feature fusion unit to quickly achieve high-quality segmentation. Our system yields real-time inference on a single GPU card with decent quality results evaluated on challenging datasets like Cityscapes, CamVid and COCO-Stuff.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Medical Image Segmentation Techniques

MethodsDilated Convolution · 1x1 Convolution · Softmax · Batch Normalization