On Efficient Real-Time Semantic Segmentation: A Survey

Christopher J. Holder; Muhammad Shafique

arXiv:2206.08605·cs.CV·October 28, 2024·21 cites

On Efficient Real-Time Semantic Segmentation: A Survey

Christopher J. Holder, Muhammad Shafique

PDF

Open Access

TL;DR

This survey reviews efficient real-time semantic segmentation models suitable for autonomous vehicles, analyzing their design, performance, and trade-offs on embedded hardware to facilitate scene understanding with limited resources.

Contribution

It provides a comprehensive taxonomy and evaluation of recent compact semantic segmentation models optimized for real-time deployment on low-memory embedded systems.

Findings

01

Many models achieve real-time inference on embedded hardware

02

Trade-off observed between model accuracy and latency

03

Evaluation under consistent hardware setups highlights performance differences

Abstract

Semantic segmentation is the problem of assigning a class label to every pixel in an image, and is an important component of an autonomous vehicle vision stack for facilitating scene understanding and object detection. However, many of the top performing semantic segmentation models are extremely complex and cumbersome, and as such are not suited to deployment onboard autonomous vehicle platforms where computational resources are limited and low-latency operation is a vital requirement. In this survey, we take a thorough look at the works that aim to address this misalignment with more compact and efficient models capable of deployment on low-memory embedded systems while meeting the constraint of real-time inference. We discuss several of the most prominent works in the field, placing them within a taxonomy based on their major contributions, and finally we evaluate the inference speed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Medical Image Segmentation Techniques

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings