Dual Pose-Graph Semantic Localization for Vision-Based Autonomous Drone Racing

David Perez-Saura; Miguel Fernandez-Cortizas; Alvaro J. Gaona; Pascual Campoy

arXiv:2604.15168·cs.RO·April 17, 2026

Dual Pose-Graph Semantic Localization for Vision-Based Autonomous Drone Racing

David Perez-Saura, Miguel Fernandez-Cortizas, Alvaro J. Gaona, Pascual Campoy

PDF

TL;DR

This paper introduces a dual pose-graph system combining odometry and semantic detections to improve real-time localization for autonomous drone racing, outperforming traditional SLAM methods under extreme conditions.

Contribution

The novel dual pose-graph architecture effectively fuses multiple observations to enhance accuracy and robustness in high-speed drone racing scenarios.

Findings

01

Achieved 56% to 74% reduction in ATE compared to standalone VIO.

02

Dual-graph architecture outperforms single-graph baseline by 10-12% in accuracy.

03

System reduces odometry drift by up to 4.2 meters per lap during real-time flight.

Abstract

Autonomous drone racing demands robust real-time localization under extreme conditions: high-speed flight, aggressive maneuvers, and payload-constrained platforms that often rely on a single camera for perception. Existing visual SLAM systems, while effective in general scenarios, struggle with motion blur and feature instability inherent to racing dynamics, and do not exploit the structured nature of racing environments. In this work, we present a dual pose-graph architecture that fuses odometry with semantic detections for robust localization. A temporary graph accumulates multiple gate observations between keyframes and optimizes them into a single refined constraint per landmark, which is then promoted to a persistent main graph. This design preserves the information richness of frequent detections while preventing graph growth from degrading real-time performance. The system is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.