Mobile-Seed: Joint Semantic Segmentation and Boundary Detection for   Mobile Robots

Youqi Liao; Shuhao Kang; Jianping Li; Yang Liu; Yun Liu; Zhen Dong,; Bisheng Yang; Xieyuanli Chen

arXiv:2311.12651·cs.CV·March 12, 2024·1 cites

Mobile-Seed: Joint Semantic Segmentation and Boundary Detection for Mobile Robots

Youqi Liao, Shuhao Kang, Jianping Li, Yang Liu, Yun Liu, Zhen Dong,, Bisheng Yang, Xieyuanli Chen

PDF

Open Access 1 Repo

TL;DR

Mobile-Seed is a lightweight dual-task framework that simultaneously performs semantic segmentation and boundary detection on mobile robots, improving accuracy and speed for real-time robotic applications.

Contribution

The paper introduces Mobile-Seed, a novel lightweight dual-task model with a two-stream encoder, active fusion decoder, and regularization, enhancing boundary detection and semantic segmentation performance.

Findings

01

Achieves 2.2% mIoU improvement over SOTA on Cityscapes

02

Maintains 23.9 FPS inference speed on high-resolution images

03

Demonstrates good generalization on multiple datasets

Abstract

Precise and rapid delineation of sharp boundaries and robust semantics is essential for numerous downstream robotic tasks, such as robot grasping and manipulation, real-time semantic mapping, and online sensor calibration performed on edge computing units. Although boundary detection and semantic segmentation are complementary tasks, most studies focus on lightweight models for semantic segmentation but overlook the critical role of boundary detection. In this work, we introduce Mobile-Seed, a lightweight, dual-task framework tailored for simultaneous semantic segmentation and boundary detection. Our framework features a two-stream encoder, an active fusion decoder (AFD) and a dual-task regularization approach. The encoder is divided into two pathways: one captures category-aware semantic information, while the other discerns boundaries from multi-scale features. The AFD module…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

whu-usi3dv/mobile-seed
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Advanced Neural Network Applications · Multimodal Machine Learning Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Focus