# Autonomous Highway Driving using Deep Reinforcement Learning

**Authors:** Subramanya Nageshrao, Eric Tseng, Dimitar Filev

arXiv: 1904.00035 · 2019-04-02

## TL;DR

This paper presents a deep reinforcement learning approach for autonomous highway driving that incorporates safety checks to improve learning efficiency and ensure safe decision-making in diverse traffic scenarios.

## Contribution

It introduces a novel RL method with safety checks that enhances training stability and generalizes to avoid undesirable near-misses and collisions.

## Key findings

- Effective in varying traffic densities
- Improved safety and stability during training
- Enhanced decision-making in complex scenarios

## Abstract

The operational space of an autonomous vehicle (AV) can be diverse and vary significantly. This may lead to a scenario that was not postulated in the design phase. Due to this, formulating a rule based decision maker for selecting maneuvers may not be ideal. Similarly, it may not be effective to design an a-priori cost function and then solve the optimal control problem in real-time. In order to address these issues and to avoid peculiar behaviors when encountering unforeseen scenario, we propose a reinforcement learning (RL) based method, where the ego car, i.e., an autonomous vehicle, learns to make decisions by directly interacting with simulated traffic. The decision maker for AV is implemented as a deep neural network providing an action choice for a given system state. In a critical application such as driving, an RL agent without explicit notion of safety may not converge or it may need extremely large number of samples before finding a reliable policy. To best address the issue, this paper incorporates reinforcement learning with an additional short horizon safety check (SC). In a critical scenario, the safety check will also provide an alternate safe action to the agent provided if it exists. This leads to two novel contributions. First, it generalizes the states that could lead to undesirable "near-misses" or "collisions ". Second, inclusion of safety check can provide a safe and stable training environment. This significantly enhances learning efficiency without inhibiting meaningful exploration to ensure safe and optimal learned behavior. We demonstrate the performance of the developed algorithm in highway driving scenario where the trained AV encounters varying traffic density in a highway setting.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.00035/full.md

## Figures

14 figures with captions in the complete paper: https://tomesphere.com/paper/1904.00035/full.md

## References

33 references — full list in the complete paper: https://tomesphere.com/paper/1904.00035/full.md

---
Source: https://tomesphere.com/paper/1904.00035