A New Approach for Tactical Decision Making in Lane Changing: Sample   Efficient Deep Q Learning with a Safety Feedback Reward

M. Ugur Yavas; N. Kemal Ure; Tufan Kumbasar

arXiv:2009.11905·cs.AI·September 28, 2020·1 cites

A New Approach for Tactical Decision Making in Lane Changing: Sample Efficient Deep Q Learning with a Safety Feedback Reward

M. Ugur Yavas, N. Kemal Ure, Tufan Kumbasar

PDF

Open Access

TL;DR

This paper introduces a safety-enhanced Deep Q Learning approach using Rainbow DQN for lane-changing in automated vehicles, improving safety, efficiency, and interpretability in dynamic environments.

Contribution

It proposes a novel safety feedback reward scheme integrated with Rainbow DQN, enhancing sample efficiency and decision interpretability in lane change tasks.

Findings

01

Significant performance improvement over baseline algorithms.

02

Enhanced sample efficiency with only 200,000 training steps.

03

Better interpretability of agent actions through Q value distributions.

Abstract

Automated lane change is one of the most challenging task to be solved of highly automated vehicles due to its safety-critical, uncertain and multi-agent nature. This paper presents the novel deployment of the state of art Q learning method, namely Rainbow DQN, that uses a new safety driven rewarding scheme to tackle the issues in an dynamic and uncertain simulation environment. We present various comparative results to show that our novel approach of having reward feedback from the safety layer dramatically increases both the agent's performance and sample efficiency. Furthermore, through the novel deployment of Rainbow DQN, it is shown that more intuition about the agent's actions is extracted by examining the distributions of generated Q values of the agents. The proposed algorithm shows superior performance to the baseline algorithm in the challenging scenarios with only 200000…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAutonomous Vehicle Technology and Safety · Traffic control and management · Reinforcement Learning in Robotics

MethodsConvolution · Q-Learning · Dense Connections · N-step Returns · Noisy Linear Layer · Double Q-learning · Deep Q-Network · Dueling Network · Prioritized Experience Replay · Rainbow DQN