A Bayesian Approach to Reinforcement Learning of Vision-Based Vehicular   Control

Zahra Gharaee; Karl Holmquist; Linbo He; Michael Felsberg

arXiv:2104.03807·cs.CV·April 9, 2021

A Bayesian Approach to Reinforcement Learning of Vision-Based Vehicular Control

Zahra Gharaee, Karl Holmquist, Linbo He, Michael Felsberg

PDF

Open Access 1 Repo

TL;DR

This paper introduces a Bayesian reinforcement learning method for autonomous vehicle control using vision data, demonstrating improved performance and efficiency in a simulated urban environment.

Contribution

It presents a novel Bayesian temporal difference learning approach for vision-based vehicle control, with comprehensive evaluation in a realistic simulator.

Findings

01

Training on ground truth segmentation improves performance.

02

The proposed method reduces training time compared to existing approaches.

03

System outperforms competitors on the CARLA benchmark.

Abstract

In this paper, we present a state-of-the-art reinforcement learning method for autonomous driving. Our approach employs temporal difference learning in a Bayesian framework to learn vehicle control signals from sensor data. The agent has access to images from a forward facing camera, which are preprocessed to generate semantic segmentation maps. We trained our system using both ground truth and estimated semantic segmentation input. Based on our observations from a large set of experiments, we conclude that training the system on ground truth input data leads to better performance than training the system on estimated input even if estimated input is used for evaluation. The system is trained and evaluated in a realistic simulated urban environment using the CARLA simulator. The simulator also contains a benchmark that allows for comparing to other systems and methods. The required…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zahrag/BRLVBVC
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Autonomous Vehicle Technology and Safety · Advanced Bandit Algorithms Research

MethodsEntropy Regularization · Proximal Policy Optimization · CARLA: An Open Urban Driving Simulator