Encoding Distributional Soft Actor-Critic for Autonomous Driving in   Multi-lane Scenarios

Jingliang Duan; Yangang Ren; Fawang Zhang; Yang Guan; Dongjie Yu,; Shengbo Eben Li; Bo Cheng; Lin Zhao

arXiv:2109.05540·cs.RO·September 14, 2021·5 cites

Encoding Distributional Soft Actor-Critic for Autonomous Driving in Multi-lane Scenarios

Jingliang Duan, Yangang Ren, Fawang Zhang, Yang Guan, Dongjie Yu,, Shengbo Eben Li, Bo Cheng, Lin Zhao

PDF

Open Access

TL;DR

This paper introduces E-DSAC, a novel reinforcement learning algorithm for autonomous driving that handles variable surrounding vehicles without manual rules, achieving higher performance and safety in multi-lane scenarios.

Contribution

The paper develops an encoding distributional policy iteration framework with permutation invariance, and proposes E-DSAC, a new RL algorithm that improves decision-making in autonomous driving.

Findings

01

E-DSAC policy is about three times more effective than DSAC.

02

E-DSAC achieves efficient, smooth, and safe driving in multi-lane scenarios.

03

The algorithm's effectiveness is validated in real vehicle experiments.

Abstract

In this paper, we propose a new reinforcement learning (RL) algorithm, called encoding distributional soft actor-critic (E-DSAC), for decision-making in autonomous driving. Unlike existing RL-based decision-making methods, E-DSAC is suitable for situations where the number of surrounding vehicles is variable and eliminates the requirement for manually pre-designed sorting rules, resulting in higher policy performance and generality. We first develop an encoding distributional policy iteration (DPI) framework by embedding a permutation invariant module, which employs a feature neural network (NN) to encode the indicators of each vehicle, in the distributional RL framework. The proposed DPI framework is proved to exhibit important properties in terms of convergence and global optimality. Next, based on the developed encoding DPI framework, we propose the E-DSAC algorithm by adding the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Autonomous Vehicle Technology and Safety · Traffic control and management