FactorSmith: Agentic Simulation Generation via Markov Decision Process Decomposition with Planner-Designer-Critic Refinement
Ali Shamsaddinlou, Morteza NourelahiAlamdari

TL;DR
FactorSmith introduces a novel framework combining factored POMDP decomposition and an agentic planner-designer-critic workflow to generate high-quality, executable game simulations from natural language descriptions, addressing LLM reasoning limitations.
Contribution
It presents a new method that decomposes simulation generation into modular steps with iterative agentic refinement, improving code quality and alignment from textual specifications.
Findings
Enhanced prompt alignment in generated simulations
Reduced runtime errors compared to baselines
Higher code quality demonstrated on benchmark tasks
Abstract
Generating executable simulations from natural language specifications remains a challenging problem due to the limited reasoning capacity of large language models (LLMs) when confronted with large, interconnected codebases. This paper presents FactorSmith, a framework that synthesizes playable game simulations in code from textual descriptions by combining two complementary ideas: factored POMDP decomposition for principled context reduction and a hierarchical planner-designer-critic agentic workflow for iterative quality refinement at every generation step. Drawing on the factored partially observable Markov decision process (POMDP) representation introduced by FactorSim [Sun et al., 2024], the proposed method decomposes a simulation specification into modular steps where each step operates only on a minimal subset of relevant state variables, limiting the context window that any…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Artificial Intelligence in Games · Multi-Agent Systems and Negotiation
