M-FasterSeg: An Efficient Semantic Segmentation Network Based on Neural   Architecture Search

Junjun Wu; Huiyu Kuang; Qinghua Lu; Zeqin Lin; Qingwu Shi; Xilin Liu,; Xiaoman Zhu

arXiv:2112.07918·cs.CV·May 31, 2022

M-FasterSeg: An Efficient Semantic Segmentation Network Based on Neural Architecture Search

Junjun Wu, Huiyu Kuang, Qinghua Lu, Zeqin Lin, Qingwu Shi, Xilin Liu,, Xiaoman Zhu

PDF

Open Access

TL;DR

M-FasterSeg introduces an efficient semantic segmentation network that combines neural architecture search and self-attention mechanisms, achieving a balance of high accuracy and real-time speed suitable for practical applications.

Contribution

The paper proposes a novel neural network structure for semantic segmentation that integrates NAS and self-attention, optimizing edge segmentation and robustness in complex scenes.

Findings

01

Accuracy of 69.8% on Cityscapes dataset

02

Segmentation speed of 48 frames per second

03

Improved edge segmentation and robustness

Abstract

Image semantic segmentation technology is one of the key technologies for intelligent systems to understand natural scenes. As one of the important research directions in the field of visual intelligence, this technology has broad application scenarios in the fields of mobile robots, drones, smart driving, and smart security. However, in the actual application of mobile robots, problems such as inaccurate segmentation semantic label prediction and loss of edge information of segmented objects and background may occur. This paper proposes an improved structure of a semantic segmentation network based on a deep learning network that combines self-attention neural network and neural network architecture search methods. First, a neural network search method NAS (Neural Architecture Search) is used to find a semantic segmentation network with multiple resolution branches. In the search…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Technologies in Various Fields · E-commerce and Technology Innovations

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings