An Implementation of Werewolf Agent That does not Truly Trust LLMs

Takehiro Sato; Shintaro Ozaki; Daisaku Yokoyama

arXiv:2409.01575·cs.CL·September 4, 2024

An Implementation of Werewolf Agent That does not Truly Trust LLMs

Takehiro Sato, Shintaro Ozaki, Daisaku Yokoyama

PDF

Open Access

TL;DR

This paper presents a hybrid werewolf game agent combining LLMs and rule-based methods to improve conversational consistency, persona, and logical behavior, with qualitative evaluation showing increased human-likeness.

Contribution

The paper introduces a novel hybrid approach that integrates rule-based algorithms with LLMs to address challenges in werewolf game agents, enhancing consistency and persona.

Findings

01

Agent perceived as more human-like than unmodified LLM

02

Hybrid approach improves logical and situational responses

03

Agent can refute, end conversations, and behave with persona

Abstract

Werewolf is an incomplete information game, which has several challenges when creating a computer agent as a player given the lack of understanding of the situation and individuality of utterance (e.g., computer agents are not capable of characterful utterance or situational lying). We propose a werewolf agent that solves some of those difficulties by combining a Large Language Model (LLM) and a rule-based algorithm. In particular, our agent uses a rule-based algorithm to select an output either from an LLM or a template prepared beforehand based on the results of analyzing conversation history using an LLM. It allows the agent to refute in specific situations, identify when to end the conversation, and behave with persona. This approach mitigated conversational inconsistencies and facilitated logical utterance as a result. We also conducted a qualitative evaluation, which resulted in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Rights Management and Security