Curriculum-Based Reinforcement Learning for Autonomous UAV Navigation in Unknown Curved Tubular Conduit
Zamirddine Mari, J\'er\^ome Pasquet, Julien Seinturier

TL;DR
This paper presents a reinforcement learning method enabling autonomous UAV navigation in unknown curved tubular environments using only local sensors, outperforming traditional algorithms with explicit geometric knowledge.
Contribution
A curriculum-based reinforcement learning framework allowing UAVs to navigate unknown tubes without prior geometric information, using local observations and a novel turning-negotiation mechanism.
Findings
RL policy outperforms deterministic baseline in unknown environments
The approach generalizes to complex curved geometries
Successful transfer from simulation to physical environment
Abstract
Autonomous drone navigation in confined tubular environments remains a major challenge due to the constraining geometry of the conduits, the proximity of the walls, and the perceptual limitations inherent to such scenarios. We propose a reinforcement learning approach enabling a drone to navigate unknown three-dimensional tubes without any prior knowledge of their geometry, relying solely on local observations from LiDAR and a conditional visual detection of the tube center. In contrast, the Pure Pursuit algorithm, used as a deterministic baseline, benefits from explicit access to the centerline, creating an information asymmetry designed to assess the ability of RL to compensate for the absence of a geometric model. The agent is trained through a progressive Curriculum Learning strategy that gradually exposes it to increasingly curved geometries, where the tube center frequently…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Robotic Path Planning Algorithms · Distributed Control Multi-Agent Systems
