WildOS: Open-Vocabulary Object Search in the Wild
Hardik Shah, Erica Tevere, Deegan Atha, Marcel Kaufmann, Shehryar Khattak, Manthan Patel, Marco Hutter, Jonas Frey, Patrick Spieler

TL;DR
WildOS is a novel robotic navigation system that combines semantic visual reasoning with geometric exploration to enable long-range, open-vocabulary object search in complex outdoor environments, improving robustness and efficiency.
Contribution
This work introduces WildOS, integrating foundation-model-based visual reasoning with geometric exploration for semantic navigation and distant target localization in outdoor robotics.
Findings
WildOS outperforms geometric-only baselines in efficiency and autonomy.
The system effectively localizes open-vocabulary targets beyond immediate depth.
Field experiments demonstrate robustness across diverse terrains.
Abstract
Autonomous navigation in complex, unstructured outdoor environments requires robots to operate over long ranges without prior maps and limited depth sensing. In such settings, relying solely on geometric frontiers for exploration is often insufficient. In such settings, the ability to reason semantically about where to go and what is safe to traverse is crucial for robust, efficient exploration. This work presents WildOS, a unified system for long-range, open-vocabulary object search that combines safe geometric exploration with semantic visual reasoning. WildOS builds a sparse navigation graph to maintain spatial memory, while utilizing a foundation-model-based vision module, ExploRFM, to score frontier nodes of the graph. ExploRFM simultaneously predicts traversability, visual frontiers, and object similarity in image space, enabling real-time, onboard semantic navigation tasks. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Robotics and Sensor-Based Localization · Advanced Neural Network Applications
