PCGRLLM: Large Language Model-Driven Reward Design for Procedural   Content Generation Reinforcement Learning

In-Chang Baek; Sung-Hyun Kim; Sam Earle; Zehua Jiang; Noh Jin-Ha,; Julian Togelius; Kyung-Joong Kim

arXiv:2502.10906·cs.AI·February 18, 2025

PCGRLLM: Large Language Model-Driven Reward Design for Procedural Content Generation Reinforcement Learning

In-Chang Baek, Sung-Hyun Kim, Sam Earle, Zehua Jiang, Noh Jin-Ha,, Julian Togelius, Kyung-Joong Kim

PDF

Open Access

TL;DR

This paper introduces PCGRLLM, a novel approach using large language models with feedback and reasoning techniques to automate reward design in game AI, significantly reducing human effort and improving performance.

Contribution

The work presents an extended architecture for reward generation employing LLMs with feedback and reasoning prompts, demonstrating improved performance in content generation tasks.

Findings

01

Achieved 415% performance improvement with one LLM

02

Achieved 40% performance improvement with another LLM

03

Demonstrated generalizability across environments and models

Abstract

Reward design plays a pivotal role in the training of game AIs, requiring substantial domain-specific knowledge and human effort. In recent years, several studies have explored reward generation for training game agents and controlling robots using large language models (LLMs). In the content generation literature, there has been early work on generating reward functions for reinforcement learning agent generators. This work introduces PCGRLLM, an extended architecture based on earlier work, which employs a feedback mechanism and several reasoning-based prompt engineering techniques. We evaluate the proposed method on a story-to-reward generation task in a two-dimensional environment using two state-of-the-art LLMs, demonstrating the generalizability of our approach. Our experiments provide insightful evaluations that demonstrate the capabilities of LLMs essential for content generation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Software Engineering Research