PixelNav: Towards Model-based Vision-Only Navigation with Topological Graphs

Sergey Bakulin; Timur Akhtyamov; Denis Fatykhov; German Devchich; Gonzalo Ferrer

arXiv:2507.20892·cs.RO·July 29, 2025

PixelNav: Towards Model-based Vision-Only Navigation with Topological Graphs

Sergey Bakulin, Timur Akhtyamov, Denis Fatykhov, German Devchich, Gonzalo Ferrer

PDF

TL;DR

PixelNav introduces a hybrid vision-only navigation system for mobile robots that combines deep learning and classical planning, using topological graphs for interpretability and scalability, validated through extensive real-world experiments.

Contribution

A hierarchical hybrid approach integrating deep learning and model-based planning with topological graphs for interpretable, scalable robot navigation.

Findings

01

Demonstrates efficiency in real-world navigation tasks

02

Provides higher interpretability than end-to-end models

03

Scalable system with effective environment representation

Abstract

This work proposes a novel hybrid approach for vision-only navigation of mobile robots, which combines advances of both deep learning approaches and classical model-based planning algorithms. Today, purely data-driven end-to-end models are dominant solutions to this problem. Despite advantages such as flexibility and adaptability, the requirement of a large amount of training data and limited interpretability are the main bottlenecks for their practical applications. To address these limitations, we propose a hierarchical system that utilizes recent advances in model predictive control, traversability estimation, visual place recognition, and pose estimation, employing topological graphs as a representation of the target environment. Using such a combination, we provide a scalable system with a higher level of interpretability compared to end-to-end approaches. Extensive real-world…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.