Dungeon Crawl Stone Soup as an Evaluation Domain for Artificial Intelligence
Dustin Dannenhauer, Michael W. Floyd, Jonathan Decker, David W. Aha

TL;DR
Dungeon Crawl Stone Soup, a complex open-source rogue-like game, serves as a valuable platform for evaluating artificial intelligence systems due to its decision space and ongoing API development.
Contribution
The paper introduces the properties of Dungeon Crawl Stone Soup that make it suitable for AI evaluation and details an API to facilitate research in this domain.
Findings
Complex decision space enables diverse AI testing
API development supports standardized AI research
Potential for human-AI interaction studies
Abstract
Dungeon Crawl Stone Soup is a popular, single-player, free and open-source rogue-like video game with a sufficiently complex decision space that makes it an ideal testbed for research in cognitive systems and, more generally, artificial intelligence. This paper describes the properties of Dungeon Crawl Stone Soup that are conducive to evaluating new approaches of AI systems. We also highlight an ongoing effort to build an API for AI researchers in the spirit of recent game APIs such as MALMO, ELF, and the Starcraft II API. Dungeon Crawl Stone Soup's complexity offers significant opportunities for evaluating AI and cognitive systems, including human user studies. In this paper we provide (1) a description of the state space of Dungeon Crawl Stone Soup, (2) a description of the components for our API, and (3) the potential benefits of evaluating AI agents in the Dungeon Crawl Stone Soup…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Games · Reinforcement Learning in Robotics · AI-based Problem Solving and Planning
