NSS-VAEs: Generative Scene Decomposition for Visual Navigable Space Construction
Zheng Chen, Lantao Liu

TL;DR
NSS-VAEs is an unsupervised generative model that learns to segment navigable space in scenes, accurately and with uncertainty estimation, without requiring pixel-level annotations, thus aiding robot navigation in unstructured environments.
Contribution
The paper introduces NSS-VAEs, a novel unsupervised variational autoencoder framework for scene decomposition that outperforms supervised methods in navigable space segmentation.
Findings
Achieves over 90% accuracy without labels
Outperforms state-of-the-art supervised methods with few labels
Provides uncertainty estimates for scene segmentation
Abstract
Detecting navigable space is the first and also a critical step for successful robot navigation. In this work, we treat the visual navigable space segmentation as a scene decomposition problem and propose a new network, NSS-VAEs (Navigable Space Segmentation Variational AutoEncoders), a representation-learning-based framework to enable robots to learn the navigable space segmentation in an unsupervised manner. Different from prevalent segmentation techniques which heavily rely on supervised learning strategies and typically demand immense pixel-level annotated images, the proposed framework leverages a generative model - Variational Auto-Encoder (VAE) - to learn a probabilistic polyline representation that compactly outlines the desired navigable space boundary. Uniquely, our method also assesses the prediction uncertainty related to the unstructuredness of the scenes, which is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Neural Network Applications · Multimodal Machine Learning Applications
