Local-to-Global Information Communication for Real-Time Semantic   Segmentation Network Search

Guangliang Cheng; Peng Sun; Ting-Bing Xu; Shuchang Lyu; Peiwen Lin

arXiv:2302.08481·cs.CV·February 17, 2023·1 cites

Local-to-Global Information Communication for Real-Time Semantic Segmentation Network Search

Guangliang Cheng, Peng Sun, Ting-Bing Xu, Shuchang Lyu, Peiwen Lin

PDF

Open Access

TL;DR

This paper introduces LGCNet, a neural architecture search framework for real-time semantic segmentation that effectively balances accuracy and speed by employing a new search space and local-to-global information communication modules.

Contribution

The paper proposes a novel search space with cell-independent architecture and two communication modules for local and global information exchange, improving real-time segmentation performance.

Findings

01

Achieves 74.0% mIoU on Cityscapes with 115.2 FPS

02

Outperforms previous methods in accuracy-speed trade-off

03

Demonstrates effectiveness on Cityscapes and CamVid datasets

Abstract

Neural Architecture Search (NAS) has shown great potentials in automatically designing neural network architectures for real-time semantic segmentation. Unlike previous works that utilize a simplified search space with cell-sharing way, we introduce a new search space where a lightweight model can be more effectively searched by replacing the cell-sharing manner with cell-independent one. Based on this, the communication of local to global information is achieved through two well-designed modules. For local information exchange, a graph convolutional network (GCN) guided module is seamlessly integrated as a communication deliver between cells. For global information aggregation, we propose a novel dense-connected fusion module (cell) which aggregates long-range multi-level features in the network automatically. In addition, a latency-oriented constraint is endowed into the search…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings