# Single-step Options for Adversary Driving

**Authors:** Nazmus Sakib, Hengshuai Yao, Hong Zhang, Shangling Jui

arXiv: 1903.08606 · 2019-12-02

## TL;DR

This paper introduces a reinforcement learning approach using single-step options for adversary driving, which simplifies training and improves performance over primitive actions, human testers, and existing planning methods.

## Contribution

The paper proposes a novel single-step options framework that enhances training efficiency and outperforms existing methods in adversary driving scenarios.

## Key findings

- Faster and easier training compared to primitive-action agents
- Outperforms primitive-action RL, humans, and planning methods
- Demonstrates effectiveness of single-step options in adversary driving

## Abstract

In this paper, we use reinforcement learning for safety driving in adversary settings. In our work, the knowledge in state-of-art planning methods is reused by single-step options whose action suggestions are compared in parallel with primitive actions. We show two advantages by doing so. First, training this reinforcement learning agent is easier and faster than training the primitive-action agent. Second, our new agent outperforms the primitive-action reinforcement learning agent, human testers as well as the state-of-art planning methods that our agent queries as skill options.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1903.08606/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/1903.08606/full.md

## References

40 references — full list in the complete paper: https://tomesphere.com/paper/1903.08606/full.md

---
Source: https://tomesphere.com/paper/1903.08606