InterviewSim: A Scalable Framework for Interview-Grounded Personality Simulation
Yu Li, Pranav Narayanan Venkit, Yada Pruksachatkun, Chien-Sheng Wu

TL;DR
This paper introduces InterviewSim, a large-scale interview-grounded evaluation framework for personality simulation using extensive real interview data, enabling better assessment of personality models' authenticity and factual accuracy.
Contribution
It presents a novel large-scale dataset and multi-dimensional evaluation framework for assessing personality simulation grounded in real interview data, surpassing prior proxy-based methods.
Findings
Models grounded in real interview data outperform profile-based approaches.
Retrieval-augmented methods excel at capturing personality style.
Chronological methods better preserve factual knowledge.
Abstract
Simulating real personalities with large language models requires grounding generation in authentic personal data. Existing evaluation approaches rely on demographic surveys, personality questionnaires, or short AI-led interviews as proxies, but lack direct assessment against what individuals actually said. We address this gap with an interview-grounded evaluation framework for personality simulation at a large scale. We extract over 671,000 question-answer pairs from 23,000 verified interview transcripts across 1,000 public personalities, each with an average of 11.5 hours of interview content. We propose a multi-dimensional evaluation framework with four complementary metrics measuring content similarity, factual consistency, personality alignment, and factual knowledge retention. Through systematic comparison, we demonstrate that methods grounded in real interview data substantially…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPersonality Traits and Psychology · Topic Modeling · Computational and Text Analysis Methods
