Individual Turing Test: A Case Study of LLM-based Simulation Using Longitudinal Personal Data
Minghao Guo, Ziyi Ye, Wujiang Xu, Xi Zhu, Wenyue Hua, Dimitris N. Metaxas

TL;DR
This study evaluates LLMs' ability to simulate a specific individual using ten years of personal messaging data, revealing current methods' limitations and trade-offs between different approaches.
Contribution
It introduces the 'Individual Turing Test' for assessing personalized LLM simulation and compares various methods using longitudinal personal data.
Findings
Current LLM-based methods do not pass the Individual Turing Test.
Simulation performance is better on strangers than on the actual individual.
Fine-tuning enhances daily chat style simulation; retrieval and memory methods excel in opinions and preferences.
Abstract
Large Language Models (LLMs) have demonstrated remarkable human-like capabilities, yet their ability to replicate a specific individual remains under-explored. This paper presents a case study to investigate LLM-based individual simulation with a volunteer-contributed archive of private messaging history spanning over ten years. Based on the messaging data, we propose the "Individual Turing Test" to evaluate whether acquaintances of the volunteer can correctly identify which response in a multi-candidate pool most plausibly comes from the volunteer. We investigate prevalent LLM-based individual simulation approaches including: fine-tuning, retrieval-augmented generation (RAG), memory-based approach, and hybrid methods that integrate fine-tuning and RAG or memory. Empirical results show that current LLM-based simulation methods do not pass the Individual Turing Test, but they perform…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in Service Interactions · Authorship Attribution and Profiling · Language and cultural evolution
