Concept Incongruence: An Exploration of Time and Death in Role Playing

Xiaoyan Bai; Ike Peng; Aditya Singh; Chenhao Tan

arXiv:2505.14905·cs.CL·May 22, 2025

Concept Incongruence: An Exploration of Time and Death in Role Playing

Xiaoyan Bai, Ike Peng, Aditya Singh, Chenhao Tan

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper investigates how large language models handle concept incongruence, especially regarding temporal boundaries in role-playing scenarios, revealing challenges in abstention and accuracy, and proposing metrics and insights for improvement.

Contribution

The paper introduces the concept of incongruence in language models, proposes behavioral metrics for analysis, and identifies key causes affecting model performance in role-play settings.

Findings

01

Models fail to abstain after role death.

02

Accuracy drops under role-play incongruence.

03

Unreliable encoding of 'death' state affects behavior.

Abstract

Consider this prompt "Draw a unicorn with two horns". Should large language models (LLMs) recognize that a unicorn has only one horn by definition and ask users for clarifications, or proceed to generate something anyway? We introduce concept incongruence to capture such phenomena where concept boundaries clash with each other, either in user prompts or in model representations, often leading to under-specified or mis-specified behaviors. In this work, we take the first step towards defining and analyzing model behavior under concept incongruence. Focusing on temporal boundaries in the Role-Play setting, we propose three behavioral metrics--abstention rate, conditional accuracy, and answer rate--to quantify model behavior under incongruence due to the role's death. We show that models fail to abstain after death and suffer from an accuracy drop compared to the Non-Role-Play setting.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chicagohai/concept-incongruence
noneOfficial

Videos

Concept Incongruence: An Exploration of Time and Death in Role Playing· slideslive

Taxonomy

TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Multimodal Machine Learning Applications