Skill Check: Some Considerations on the Evaluation of Gamemastering   Models for Role-playing Games

Santiago G\'ongora; Luis Chiruzzo; Gonzalo M\'endez; Pablo Gerv\'as

arXiv:2309.13702·cs.CL·October 3, 2023·1 cites

Skill Check: Some Considerations on the Evaluation of Gamemastering Models for Role-playing Games

Santiago G\'ongora, Luis Chiruzzo, Gonzalo M\'endez, Pablo Gerv\'as

PDF

Open Access 1 Repo

TL;DR

This paper discusses the challenges in modeling Game Masters for role-playing games using AI, proposes evaluation categories, and tests existing AI models like ChatGPT and Bard as GMs.

Contribution

It introduces a framework for evaluating AI-based Game Master models and provides empirical testing of popular language models in this role.

Findings

01

ChatGPT performs comparably to specialized GMs

02

Evaluation categories reveal strengths and weaknesses of models

03

Proposes future directions for AI-driven role-playing game GMs

Abstract

In role-playing games a Game Master (GM) is the player in charge of the game, who must design the challenges the players face and narrate the outcomes of their actions. In this work we discuss some challenges to model GMs from an Interactive Storytelling and Natural Language Processing perspective. Following those challenges we propose three test categories to evaluate such dialogue systems, and we use them to test ChatGPT, Bard and OpenAssistant as out-of-the-box GMs.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sgongora27/skill-check-gm-tests
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Artificial Intelligence in Games · Natural Language Processing Techniques