Human Values Matter: Investigating How Misalignment Shapes Collective Behaviors in LLM Agent Communities

Xiangxu Zhang; Jiamin Wang; Qinlin Zhao; Hanze Guo; Linzhuo Li; Jing Yao; Xiao Zhou; Xiaoyuan Yi; Xing Xie

arXiv:2604.05339·cs.CL·April 8, 2026

Human Values Matter: Investigating How Misalignment Shapes Collective Behaviors in LLM Agent Communities

Xiangxu Zhang, Jiamin Wang, Qinlin Zhao, Hanze Guo, Linzhuo Li, Jing Yao, Xiao Zhou, Xiaoyuan Yi, Xing Xie

PDF

TL;DR

This paper introduces CIVA, a social science-grounded multi-agent environment, to study how misalignment with human values affects collective behaviors and failures in LLM agent communities.

Contribution

It presents a systematic framework and experimental analysis revealing how value misalignment influences community dynamics, failures, and emergent behaviors in LLM multi-agent systems.

Findings

01

Identification of critical values shaping community dynamics

02

Detection of macro-level system failures like collapse

03

Observation of micro-level emergent behaviors such as deception

Abstract

As LLMs become increasingly integrated into human society, evaluating their orientations on human values from social science has drawn growing attention. Nevertheless, it is still unclear why human values matter for LLMs, especially in LLM-based multi-agent systems, where group-level failures may accumulate from individually misaligned actions. We ask whether misalignment with human values alters the collective behavior of LLM agents and what changes it induces? In this work, we introduce CIVA, a controlled multi-agent environment grounded in social science theories, where LLM agents form a community and autonomously communicate, explore, and compete for resources, enabling systematic manipulation of value prevalence and behavioral analysis. Through comprehensive simulation experiments, we reveal three key findings. (1) We identify several structurally critical values that substantially…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.