Smooth Trajectory Collision Avoidance through Deep Reinforcement Learning
Sirui Song, Kirk Saunders, Ye Yue, Jundong Liu

TL;DR
This paper introduces a deep reinforcement learning approach for autonomous UAV navigation that emphasizes smooth trajectories and better generalization to unseen environments, using novel reward functions and state designs.
Contribution
It proposes new agent state and reward function designs that improve trajectory smoothness and model generalization in DRL-based UAV navigation.
Findings
Enhanced flight trajectory smoothness with minimized derivatives.
Improved collision avoidance in unseen environments.
Effective DRL-based navigation demonstrated through experiments.
Abstract
Collision avoidance is a crucial task in vision-guided autonomous navigation. Solutions based on deep reinforcement learning (DRL) has become increasingly popular. In this work, we proposed several novel agent state and reward function designs to tackle two critical issues in DRL-based navigation solutions: 1) smoothness of the trained flight trajectories; and 2) model generalization to handle unseen environments. Formulated under a DRL framework, our model relies on margin reward and smoothness constraints to ensure UAVs fly smoothly while greatly reducing the chance of collision. The proposed smoothness reward minimizes a combination of first-order and second-order derivatives of flight trajectories, which can also drive the points to be evenly distributed, leading to stable flight speed. To enhance the agent's capability of handling new unseen environments, two practical setups are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Path Planning Algorithms · Reinforcement Learning in Robotics · UAV Applications and Optimization
