Monocular Obstacle Avoidance Based on Inverse PPO for Fixed-wing UAVs
Haochen Chai, Meimei Su, Yang Lyu, Zhunga Liu, Chunhui Zhao, Quan Pan

TL;DR
This paper presents a lightweight deep reinforcement learning system for fixed-wing UAVs that enables obstacle avoidance using only onboard visual sensors, suitable for real-time deployment in unknown low-altitude environments.
Contribution
It introduces a novel DRL-based obstacle avoidance framework with a streamlined depth inference module and adaptive entropy mechanism tailored for fixed-wing UAVs.
Findings
Outperforms existing methods in obstacle avoidance efficiency
Ensures real-time operation on edge computing devices
Demonstrates successful hardware-in-the-loop experiments
Abstract
Fixed-wing Unmanned Aerial Vehicles (UAVs) are one of the most commonly used platforms for the burgeoning Low-altitude Economy (LAE) and Urban Air Mobility (UAM), due to their long endurance and high-speed capabilities. Classical obstacle avoidance systems, which rely on prior maps or sophisticated sensors, face limitations in unknown low-altitude environments and small UAV platforms. In response, this paper proposes a lightweight deep reinforcement learning (DRL) based UAV collision avoidance system that enables a fixed-wing UAV to avoid unknown obstacles at cruise speed over 30m/s, with only onboard visual sensors. The proposed system employs a single-frame image depth inference module with a streamlined network architecture to ensure real-time obstacle detection, optimized for edge computing devices. After that, a reinforcement learning controller with a novel reward function is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAerospace and Aviation Technology · Aerospace Engineering and Control Systems · Air Traffic Management and Optimization
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
