M-FasterSeg: An Efficient Semantic Segmentation Network Based on Neural Architecture Search
Junjun Wu, Huiyu Kuang, Qinghua Lu, Zeqin Lin, Qingwu Shi, Xilin Liu,, Xiaoman Zhu

TL;DR
M-FasterSeg introduces an efficient semantic segmentation network that combines neural architecture search and self-attention mechanisms, achieving a balance of high accuracy and real-time speed suitable for practical applications.
Contribution
The paper proposes a novel neural network structure for semantic segmentation that integrates NAS and self-attention, optimizing edge segmentation and robustness in complex scenes.
Findings
Accuracy of 69.8% on Cityscapes dataset
Segmentation speed of 48 frames per second
Improved edge segmentation and robustness
Abstract
Image semantic segmentation technology is one of the key technologies for intelligent systems to understand natural scenes. As one of the important research directions in the field of visual intelligence, this technology has broad application scenarios in the fields of mobile robots, drones, smart driving, and smart security. However, in the actual application of mobile robots, problems such as inaccurate segmentation semantic label prediction and loss of edge information of segmented objects and background may occur. This paper proposes an improved structure of a semantic segmentation network based on a deep learning network that combines self-attention neural network and neural network architecture search methods. First, a neural network search method NAS (Neural Architecture Search) is used to find a semantic segmentation network with multiple resolution branches. In the search…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Technologies in Various Fields · E-commerce and Technology Innovations
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
