HumanLLM: Towards Personalized Understanding and Simulation of Human Nature
Yuxuan Lei, Tianfu Wang, Jianxun Lian, Zhengyu Hu, Defu Lian, Xing Xie

TL;DR
HumanLLM is a personalized foundation model trained on a large-scale user behavior dataset, enabling more accurate simulation and understanding of individual human cognition and social behaviors, with implications for social science and business.
Contribution
It introduces HumanLLM, a novel model trained on a curated large-scale user log dataset, for personalized human behavior prediction and social simulation.
Findings
Outperforms base models in predicting user actions and thoughts
More accurately mimics user writing styles and preferences
Shows improved generalization on social intelligence benchmarks
Abstract
Motivated by the remarkable progress of large language models (LLMs) in objective tasks like mathematics and coding, there is growing interest in their potential to simulate human behavior--a capability with profound implications for transforming social science research and customer-centric business insights. However, LLMs often lack a nuanced understanding of human cognition and behavior, limiting their effectiveness in social simulation and personalized applications. We posit that this limitation stems from a fundamental misalignment: standard LLM pretraining on vast, uncontextualized web data does not capture the continuous, situated context of an individual's decisions, thoughts, and behaviors over time. To bridge this gap, we introduce HumanLLM, a foundation model designed for personalized understanding and simulation of individuals. We first construct the Cognitive Genome Dataset,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health via Writing · Sentiment Analysis and Opinion Mining · Topic Modeling
