GoalSwarm: Multi-UAV Semantic Coordination for Open-Vocabulary Object Navigation
MoniJesu Wonders James, Amir Atef Habel, Aleksey Fedoseev, and Dzmitry Tsetserokou

TL;DR
GoalSwarm enables decentralized multi-UAV teams to perform open-vocabulary object navigation efficiently by constructing shared semantic maps, integrating foundation models, and coordinating exploration without heavy onboard computation.
Contribution
It introduces a novel decentralized framework that combines lightweight semantic mapping, foundation model integration, and multi-agent coordination for open-vocabulary navigation.
Findings
Achieves robust zero-shot object navigation in unknown environments.
Reduces onboard computational load with 2D semantic mapping.
Demonstrates effective multi-UAV coordination for exploration.
Abstract
Cooperative visual semantic navigation is a foundational capability for aerial robot teams operating in unknown environments. However, achieving robust open-vocabulary object-goal navigation remains challenging due to the computational constraints of deploying heavy perception models onboard and the complexity of decentralized multi-agent coordination. We present GoalSwarm, a fully decentralized multi-UAV framework for zero-shot semantic object-goal navigation. Each UAV collaboratively constructs a shared, lightweight 2D top-down semantic occupancy map by projecting depth observations from aerial vantage points, eliminating the computational burden of full 3D representations while preserving essential geometric and semantic structure. The core contributions of GoalSwarm are threefold: (1) integration of zero-shot foundation model -- SAM3 for open vocabulary detection and pixel-level…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Robotic Path Planning Algorithms · Advanced Neural Network Applications
