Reward Function Optimization of a Deep Reinforcement Learning Collision Avoidance System
Cooper Cone, Michael Owen, Luis Alvarez, Marc Brittain

TL;DR
This paper presents a method to optimize the reward function of a deep reinforcement learning-based collision avoidance system for unmanned aircraft, improving safety and operational viability through surrogate optimizer tuning.
Contribution
It introduces a surrogate optimizer-based tuning approach for DRL reward functions, enhancing collision avoidance performance for UAS.
Findings
Improved safety metrics in collision avoidance scenarios.
Enhanced operational viability of UAS collision systems.
Demonstrated effectiveness of surrogate optimizer tuning.
Abstract
The proliferation of unmanned aircraft systems (UAS) has caused airspace regulation authorities to examine the interoperability of these aircraft with collision avoidance systems initially designed for large transport category aircraft. Limitations in the currently mandated TCAS led the Federal Aviation Administration to commission the development of a new solution, the Airborne Collision Avoidance System X (ACAS X), designed to enable a collision avoidance capability for multiple aircraft platforms, including UAS. While prior research explored using deep reinforcement learning algorithms (DRL) for collision avoidance, DRL did not perform as well as existing solutions. This work explores the benefits of using a DRL collision avoidance system whose parameters are tuned using a surrogate optimizer. We show the use of a surrogate optimizer leads to DRL approach that can increase safety and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAir Traffic Management and Optimization · Traffic and Road Safety · Autonomous Vehicle Technology and Safety
