Playing the Werewolf game with artificial intelligence for language understanding
Hisaichi Shibata, Soichiro Miki, Yuta Nakamura

TL;DR
This paper presents Deep Wolf, an AI agent trained to play the social deduction game Werewolf through natural language, demonstrating competitive performance in some roles and highlighting AI's potential in understanding deception and language in social contexts.
Contribution
Developed Deep Wolf, an AI agent using a Transformer-based language model fine-tuned for predicting game outcomes and engaging in natural language deception detection and strategy.
Findings
Deep Wolf performs as well as average humans in villager and betrayer roles.
Deep Wolf is less effective than humans in werewolf and seer roles.
Language models can suspect, lie, and detect lies in social conversations.
Abstract
The Werewolf game is a social deduction game based on free natural language communication, in which players try to deceive others in order to survive. An important feature of this game is that a large portion of the conversations are false information, and the behavior of artificial intelligence (AI) in such a situation has not been widely investigated. The purpose of this study is to develop an AI agent that can play Werewolf through natural language conversations. First, we collected game logs from 15 human players. Next, we fine-tuned a Transformer-based pretrained language model to construct a value network that can predict a posterior probability of winning a game at any given phase of the game and given a candidate for the next action. We then developed an AI agent that can interact with humans and choose the best voting target on the basis of its probability from the value…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
