TL;DR
HumanLLM introduces a framework that models human cognitive patterns to improve the authenticity of large language models' anthropomorphism, using a comprehensive dataset and evaluation methods.
Contribution
It constructs a large dataset of psychological patterns and scenarios, and develops dual-level checklists to evaluate multi-pattern human alignment in LLMs.
Findings
HumanLLM-8B outperforms larger models on multi-pattern dynamics
Strong correlation (r=0.90) with human alignment metrics
Holistic metrics conflate simulation accuracy with social desirability
Abstract
Large Language Models (LLMs) have demonstrated remarkable capabilities in reasoning and generation, serving as the foundation for advanced persona simulation and Role-Playing Language Agents (RPLAs). However, achieving authentic alignment with human cognitive and behavioral patterns remains a critical challenge for these agents. We present HumanLLM, a framework treating psychological patterns as interacting causal forces. We construct 244 patterns from 12,000 academic papers and synthesize 11,359 scenarios where 2-5 patterns reinforce, conflict, or modulate each other, with multi-turn conversations expressing inner thoughts, actions, and dialogue. Our dual-level checklists evaluate both individual pattern fidelity and emergent multi-pattern dynamics, achieving strong human alignment () while revealing that holistic metrics conflate simulation accuracy with social…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
