Temporally Consistent Unsupervised Segmentation for Mobile Robot Perception
Christian Ellis, Maggie Wigness, Craig Lennon, Lance Fiondella

TL;DR
This paper introduces Frontier-Seg, a method for unsupervised, temporally consistent terrain segmentation in mobile robot video streams, eliminating the need for labeled data and improving perception in unstructured environments.
Contribution
Frontier-Seg is the first approach to enforce temporal consistency in unsupervised segmentation for mobile robot perception using foundation model features.
Findings
Effective unsupervised segmentation across diverse datasets
Maintains temporal consistency in terrain boundary detection
Operates without human supervision
Abstract
Rapid progress in terrain-aware autonomous ground navigation has been driven by advances in supervised semantic segmentation. However, these methods rely on costly data collection and labor-intensive ground truth labeling to train deep models. Furthermore, autonomous systems are increasingly deployed in unrehearsed, unstructured environments where no labeled data exists and semantic categories may be ambiguous or domain-specific. Recent zero-shot approaches to unsupervised segmentation have shown promise in such settings but typically operate on individual frames, lacking temporal consistency-a critical property for robust perception in unstructured environments. To address this gap we introduce Frontier-Seg, a method for temporally consistent unsupervised segmentation of terrain from mobile robot video streams. Frontier-Seg clusters superpixel-level features extracted from foundation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
