General Modular Harness for LLM Agents in Multi-Turn Gaming Environments

Yuxuan Zhang; Haoyang Yu; Lanxiang Hu; Haojian Jin; Hao Zhang

arXiv:2507.11633·cs.AI·July 17, 2025

General Modular Harness for LLM Agents in Multi-Turn Gaming Environments

Yuxuan Zhang, Haoyang Yu, Lanxiang Hu, Haojian Jin, Hao Zhang

PDF

Open Access

TL;DR

This paper presents a modular framework for LLM and VLM agents in multi-turn gaming environments, enabling flexible composition of perception, memory, and reasoning modules to improve performance across diverse games.

Contribution

It introduces a unified modular harness design that allows LLM-based agents to adapt to various gaming environments without domain-specific engineering.

Findings

01

Performance improves over baseline agents.

02

Memory is crucial in long-horizon puzzles.

03

Perception is vital in noisy visual arcade games.

Abstract

We introduce a modular harness design for LLM agents that composes of perception, memory, and reasoning components, enabling a single LLM or VLM backbone to tackle a wide spectrum of multi turn gaming environments without domain-specific engineering. Using classic and modern game suites as low-barrier, high-diversity testbeds, our framework provides a unified workflow for analyzing how each module affects performance across dynamic interactive settings. Extensive experiments demonstrate that the harness lifts gameplay performance consistently over un-harnessed baselines and reveals distinct contribution patterns, for example, memory dominates in long-horizon puzzles while perception is critical in vision noisy arcades. These findings highlight the effectiveness of our modular harness design in advancing general-purpose agent, given the familiarity and ubiquity of games in everyday human…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMulti-Agent Systems and Negotiation · Auction Theory and Applications · Reinforcement Learning in Robotics