Self-correcting Reward Shaping via Language Models for Reinforcement Learning Agents in Games

Ant\'onio Afonso; Iolanda Leite; Alessandro Sestini; Florian Fuchs; Konrad Tollmar; Linus Gissl\'en

arXiv:2506.23626·cs.AI·July 1, 2025

Self-correcting Reward Shaping via Language Models for Reinforcement Learning Agents in Games

Ant\'onio Afonso, Iolanda Leite, Alessandro Sestini, Florian Fuchs, Konrad Tollmar, Linus Gissl\'en

PDF

Open Access

TL;DR

This paper introduces an automated, language model-guided method for iteratively tuning reward functions in reinforcement learning agents for games, reducing manual effort and improving performance.

Contribution

It presents a novel approach where language models self-correct reward weights based on behavioral goals and performance data, enabling automated reward shaping in RL agents.

Findings

01

Significant performance improvements in racing agents, from 9% to 74% success rate.

02

LM-guided tuning achieves 80% success, close to expert-tuned 94%.

03

Automated reward shaping reduces manual effort and adapts to game changes.

Abstract

Reinforcement Learning (RL) in games has gained significant momentum in recent years, enabling the creation of different agent behaviors that can transform a player's gaming experience. However, deploying RL agents in production environments presents two key challenges: (1) designing an effective reward function typically requires an RL expert, and (2) when a game's content or mechanics are modified, previously tuned reward weights may no longer be optimal. Towards the latter challenge, we propose an automated approach for iteratively fine-tuning an RL agent's reward function weights, based on a user-defined language based behavioral goal. A Language Model (LM) proposes updated weights at each iteration based on this target behavior and a summary of performance statistics from prior training rounds. This closed-loop process allows the LM to self-correct and refine its output over time,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Artificial Intelligence in Games · Explainable Artificial Intelligence (XAI)