Reward Function Optimization of a Deep Reinforcement Learning Collision   Avoidance System

Cooper Cone; Michael Owen; Luis Alvarez; Marc Brittain

arXiv:2212.00855·cs.AI·December 5, 2022

Reward Function Optimization of a Deep Reinforcement Learning Collision Avoidance System

Cooper Cone, Michael Owen, Luis Alvarez, Marc Brittain

PDF

Open Access

TL;DR

This paper presents a method to optimize the reward function of a deep reinforcement learning-based collision avoidance system for unmanned aircraft, improving safety and operational viability through surrogate optimizer tuning.

Contribution

It introduces a surrogate optimizer-based tuning approach for DRL reward functions, enhancing collision avoidance performance for UAS.

Findings

01

Improved safety metrics in collision avoidance scenarios.

02

Enhanced operational viability of UAS collision systems.

03

Demonstrated effectiveness of surrogate optimizer tuning.

Abstract

The proliferation of unmanned aircraft systems (UAS) has caused airspace regulation authorities to examine the interoperability of these aircraft with collision avoidance systems initially designed for large transport category aircraft. Limitations in the currently mandated TCAS led the Federal Aviation Administration to commission the development of a new solution, the Airborne Collision Avoidance System X (ACAS X), designed to enable a collision avoidance capability for multiple aircraft platforms, including UAS. While prior research explored using deep reinforcement learning algorithms (DRL) for collision avoidance, DRL did not perform as well as existing solutions. This work explores the benefits of using a DRL collision avoidance system whose parameters are tuned using a surrogate optimizer. We show the use of a surrogate optimizer leads to DRL approach that can increase safety and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAir Traffic Management and Optimization · Traffic and Road Safety · Autonomous Vehicle Technology and Safety