Can LLMs Infer Conversational Agent Users' Personality Traits from Chat History?
Derya C\"ogendez, Verena Zimmermann, No\'e Zufferey

TL;DR
This study investigates the privacy risks of inferring users' personality traits from ChatGPT interactions, demonstrating that models can predict traits with above-chance accuracy, thus highlighting potential privacy concerns.
Contribution
It provides empirical evidence that LLM-based conversational agents can reveal personality traits from chat logs, quantifying the associated privacy risks.
Findings
Models achieved better-than-random accuracy in trait inference.
Extraversion prediction improved by +44% over baseline.
Different interaction types pose varying levels of privacy risk.
Abstract
Sensitive information, such as knowledge about an individual's personality, can be can be misused to influence behavior (e.g., via personalized messaging). To assess to what extent an individual's personality can be inferred from user interactions with LLM-based conversational agents (CAs), we analyze and quantify related privacy risks of using CAs. We collected actual ChatGPT logs from N=668 participants, containing 62,090 individual chats, and report statistics about the different types of shared data and use cases. We fine-tuned RoBERTa-base text classification models to infer personality traits from CA interactions. The findings show that these models achieve trait inference with accuracy (ternary classification) better than random in multiple cases. For example, for extraversion, accuracy improves by +44% relative to the baseline on interactions for relationships and personal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
