LLMs vs. Chinese Anime Enthusiasts: A Comparative Study on Emotionally Supportive Role-Playing
Lanlan Qiu, Xiao Pu, Yeqi Feng, Tianxing He

TL;DR
This study evaluates how well large language models can perform emotionally supportive role-playing with anime characters, comparing their performance to Chinese anime enthusiasts, and introduces a new dataset for this purpose.
Contribution
We created the first ESRP dataset with human and LLM dialogues on anime characters, and systematically evaluated LLMs' performance in emotional support and role-playing.
Findings
LLMs outperform humans in role-playing and emotional support.
Humans have higher response diversity than LLMs.
The dataset and evaluation system facilitate future research in ESRP.
Abstract
Large Language Models (LLMs) have demonstrated impressive capabilities in role-playing conversations and providing emotional support as separate research directions. However, there remains a significant research gap in combining these capabilities to enable emotionally supportive interactions with virtual characters. To address this research gap, we focus on anime characters as a case study because of their well-defined personalities and large fan bases. This choice enables us to effectively evaluate how well LLMs can provide emotional support while maintaining specific character traits. We introduce ChatAnime, the first Emotionally Supportive Role-Playing (ESRP) dataset. We first thoughtfully select 20 top-tier characters from popular anime communities and design 60 emotion-centric real-world scenario questions. Then, we execute a nationwide selection process to identify 40 Chinese…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImpact of Technology on Adolescents
