Boardwalk: Towards a Framework for Creating Board Games with LLMs
\'Alvaro Guglielmin Becker, Gabriel Bauer de Oliveira, Lana Bertoldo Rossato, Anderson Rocha Tavares

TL;DR
This paper explores using large language models to automatically generate playable digital versions of board games from natural language rules, aiming to streamline game development.
Contribution
It introduces Boardwalk, a framework for LLM-based board game code generation, and evaluates multiple models' ability to implement diverse games accurately.
Findings
Claude 3.7 Sonnet achieved 55.6% error-free implementations.
API compliance increased error frequency, but error severity depended on the LLM.
The approach demonstrates viability for automated board game coding.
Abstract
Implementing board games in code can be a time-consuming task. However, Large Language Models (LLMs) have been proven effective at generating code for domain-specific tasks with simple contextual information. We aim to investigate whether LLMs can implement digital versions of board games from rules described in natural language. This would be a step towards an LLM-assisted framework for quick board game code generation. We expect to determine the main challenges for LLMs to implement the board games, and how different approaches and models compare to one another. We task three state-of-the-art LLMs (Claude, DeepSeek and ChatGPT) with coding a selection of 12 popular and obscure games in free-form and within Boardwalk, our proposed General Game Playing API. We anonymize the games and components to avoid evoking pre-trained LLM knowledge. The implementations are tested for playability…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
