Learning to Drive Using Sparse Imitation Reinforcement Learning
Yuci Han, Alper Yilmaz

TL;DR
This paper introduces Sparse Imitation Reinforcement Learning (SIRL), a hybrid approach combining expert rules and reinforcement learning to improve autonomous driving efficiency, safety, and generalization in complex urban scenarios within the CARLA simulator.
Contribution
The paper presents a novel hybrid SIRL method that accelerates training, enhances safety, and surpasses traditional RL and expert policies in urban autonomous driving tasks.
Findings
SIRL accelerates training compared to traditional RL.
SIRL achieves safer exploration in complex scenarios.
SIRL generalizes well to unseen environments.
Abstract
In this paper, we propose Sparse Imitation Reinforcement Learning (SIRL), a hybrid end-to-end control policy that combines the sparse expert driving knowledge with reinforcement learning (RL) policy for autonomous driving (AD) task in CARLA simulation environment. The sparse expert is designed based on hand-crafted rules which is suboptimal but provides a risk-averse strategy by enforcing experience for critical scenarios such as pedestrian and vehicle avoidance, and traffic light detection. As it has been demonstrated, training a RL agent from scratch is data-inefficient and time consuming particularly for the urban driving task, due to the complexity of situations stemming from the vast size of state space. Our SIRL strategy provides a solution to solve these problems by fusing the output distribution of the sparse expert policy and the RL policy to generate a composite driving…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Traffic control and management · Traffic Prediction and Management Techniques
MethodsEntropy Regularization · Proximal Policy Optimization · CARLA: An Open Urban Driving Simulator
