RoleMRC: A Fine-Grained Composite Benchmark for Role-Playing and Instruction-Following
Junru Lu, Jiazheng Li, Guodong Shen, Lin Gui, Siyu An, Yulan He, Di, Yin, Xing Sun

TL;DR
RoleMRC is a comprehensive benchmark designed to evaluate and enhance large language models' abilities in fine-grained role-playing and instruction-following across diverse, complex multi-turn scenarios.
Contribution
We introduce RoleMRC, a novel benchmark with extensive role profiles and instructions, enabling detailed assessment and improvement of LLMs' role-playing and instruction-following skills.
Findings
Models fine-tuned on RoleMRC show improved instruction-following.
RoleMRC enhances models' ability to handle complex, multi-turn role-playing tasks.
Cross-evaluation confirms the effectiveness of RoleMRC in boosting LLM capabilities.
Abstract
Role-playing is important for Large Language Models (LLMs) to follow diverse instructions while maintaining role identity and the role's pre-defined ability limits. Existing role-playing datasets mostly contribute to controlling role style and knowledge boundaries, but overlook role-playing in instruction-following scenarios. We introduce a fine-grained role-playing and instruction-following composite benchmark, named RoleMRC, including: (1) Multi-turn dialogues between ideal roles and humans, including free chats or discussions upon given passages; (2) Role-playing machine reading comprehension, involving response, refusal, and attempts according to passage answerability and role ability; (3) More complex scenarios with nested, multi-turn and prioritized instructions. The final RoleMRC features a 10.2k role profile meta-pool, 37.9k well-synthesized role-playing instructions, and 1.4k…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTeaching and Learning Programming · Gender and Technology in Education
