An online evolving framework for advancing reinforcement-learning based   automated vehicle control

Teawon Han; Subramanya Nageshrao; Dimitar P. Filev; Umit Ozguner

arXiv:2006.08092·eess.SY·June 17, 2020

An online evolving framework for advancing reinforcement-learning based automated vehicle control

Teawon Han, Subramanya Nageshrao, Dimitar P. Filev, Umit Ozguner

PDF

Open Access

TL;DR

This paper introduces an online evolving framework that enhances reinforcement learning-based vehicle control by detecting and revising suboptimal decisions in real-time, improving safety and reliability in autonomous driving scenarios.

Contribution

The paper presents a novel online evolving framework with modules for dynamic state modeling and action validation, advancing reinforcement learning control methods for autonomous vehicles.

Findings

01

Inappropriate actions are effectively detected and revised.

02

The framework reduces control failures in vehicle following scenarios.

03

Enhanced safety and decision accuracy in autonomous driving.

Abstract

In this paper, an online evolving framework is proposed to detect and revise a controller's imperfect decision-making in advance. The framework consists of three modules: the evolving Finite State Machine (e-FSM), action-reviser, and controller modules. The e-FSM module evolves a stochastic model (e.g., Discrete-Time Markov Chain) from scratch by determining new states and identifying transition probabilities repeatedly. With the latest stochastic model and given criteria, the action-reviser module checks validity of the controller's chosen action by predicting future states. Then, if the chosen action is not appropriate, another action is inspected and selected. In order to show the advantage of the proposed framework, the Deep Deterministic Policy Gradient (DDPG) w/ and w/o the online evolving framework are applied to control an ego-vehicle in the car-following scenario where control…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Autonomous Vehicle Technology and Safety · Traffic control and management

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Dense Connections · Convolution · Weight Decay · Adam · *Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Experience Replay · Deep Deterministic Policy Gradient