Human-Aligned Procedural Level Generation Reinforcement Learning via Text-Level-Sketch Shared Representation

In-Chang Baek; Seoyoung Lee; Sung-Hyun Kim; Geumhwan Hwang; KyungJoong Kim

arXiv:2508.09860·cs.AI·August 14, 2025

Human-Aligned Procedural Level Generation Reinforcement Learning via Text-Level-Sketch Shared Representation

In-Chang Baek, Seoyoung Lee, Sung-Hyun Kim, Geumhwan Hwang, KyungJoong Kim

PDF

TL;DR

This paper introduces VIPCGRL, a deep reinforcement learning framework that integrates text, level, and sketches to improve human-aligned procedural content generation, demonstrating enhanced human-likeness and controllability.

Contribution

It proposes a shared embedding space trained via quadruple contrastive learning across modalities, enabling better alignment with human styles and goals in PCGRL.

Findings

01

VIPCGRL outperforms baselines in human-likeness metrics

02

Experimental results validate improved controllability and alignment

03

Human evaluations confirm enhanced human-centered behavior

Abstract

Human-aligned AI is a critical component of co-creativity, as it enables models to accurately interpret human intent and generate controllable outputs that align with design goals in collaborative content creation. This direction is especially relevant in procedural content generation via reinforcement learning (PCGRL), which is intended to serve as a tool for human designers. However, existing systems often fall short of exhibiting human-centered behavior, limiting the practical utility of AI-driven generation tools in real-world design workflows. In this paper, we propose VIPCGRL (Vision-Instruction PCGRL), a novel deep reinforcement learning framework that incorporates three modalities-text, level, and sketches-to extend control modality and enhance human-likeness. We introduce a shared embedding space trained via quadruple contrastive learning across modalities and human-AI styles,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.