Towards a Reward-Free Reinforcement Learning Framework for Vehicle   Control

Jielong Yang; Daoyuan Huang

arXiv:2502.15262·cs.LG·April 21, 2025

Towards a Reward-Free Reinforcement Learning Framework for Vehicle Control

Jielong Yang, Daoyuan Huang

PDF

TL;DR

This paper introduces a reward-free reinforcement learning framework for vehicle control that learns optimal policies without manually designed rewards, improving efficiency and adaptability in driving tasks.

Contribution

The proposed RFRLF framework directly learns target states for vehicle control using a target state prediction network and a reward-free policy network, eliminating the need for manual reward design.

Findings

01

Demonstrates effectiveness in vehicle driving control

02

Improves learning efficiency in reward-free settings

03

Adapts well to environments without explicit rewards

Abstract

Reinforcement learning plays a crucial role in vehicle control by guiding agents to learn optimal control strategies through designing or learning appropriate reward signals. However, in vehicle control applications, rewards typically need to be manually designed while considering multiple implicit factors, which easily introduces human biases. Although imitation learning methods does not rely on explicit reward signals, they necessitate high-quality expert actions, which are often challenging to acquire. To address these issues, we propose a reward-free reinforcement learning framework (RFRLF). This framework directly learns the target states to optimize agent behavior through a target state prediction network (TSPN) and a reward-free state-guided policy network (RFSGPN), avoiding the dependence on manually designed reward signals. Specifically, the policy network is learned via…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.