Curriculum-Based Reinforcement Learning for Autonomous UAV Navigation in Unknown Curved Tubular Conduit

Zamirddine Mari; J\'er\^ome Pasquet; Julien Seinturier

arXiv:2512.10934·cs.RO·December 12, 2025

Curriculum-Based Reinforcement Learning for Autonomous UAV Navigation in Unknown Curved Tubular Conduit

Zamirddine Mari, J\'er\^ome Pasquet, Julien Seinturier

PDF

Open Access

TL;DR

This paper presents a reinforcement learning method enabling autonomous UAV navigation in unknown curved tubular environments using only local sensors, outperforming traditional algorithms with explicit geometric knowledge.

Contribution

A curriculum-based reinforcement learning framework allowing UAVs to navigate unknown tubes without prior geometric information, using local observations and a novel turning-negotiation mechanism.

Findings

01

RL policy outperforms deterministic baseline in unknown environments

02

The approach generalizes to complex curved geometries

03

Successful transfer from simulation to physical environment

Abstract

Autonomous drone navigation in confined tubular environments remains a major challenge due to the constraining geometry of the conduits, the proximity of the walls, and the perceptual limitations inherent to such scenarios. We propose a reinforcement learning approach enabling a drone to navigate unknown three-dimensional tubes without any prior knowledge of their geometry, relying solely on local observations from LiDAR and a conditional visual detection of the tube center. In contrast, the Pure Pursuit algorithm, used as a deterministic baseline, benefits from explicit access to the centerline, creating an information asymmetry designed to assess the ability of RL to compensate for the absence of a geometric model. The agent is trained through a progressive Curriculum Learning strategy that gradually exposes it to increasingly curved geometries, where the tube center frequently…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Robotic Path Planning Algorithms · Distributed Control Multi-Agent Systems