Investigating social alignment via mirroring in a system of interacting language models
Harvey McGuinness, Tianyu Wang, Carey E. Priebe, Hayden Helm

TL;DR
This paper introduces a computational framework to study how mirroring behaviors influence social alignment in multi-agent language model systems, revealing that communication range and mirroring rate significantly affect system dynamics.
Contribution
It presents a scalable simulation framework for analyzing social alignment via mirroring in interacting language models, bridging computational modeling and social dynamics.
Findings
System behavior is strongly influenced by communication range.
Increased mirroring rates exacerbate alignment effects.
Simulation results relate to human social dynamics.
Abstract
Alignment is a social phenomenon wherein individuals share a common goal or perspective. Mirroring, or mimicking the behaviors and opinions of another individual, is one mechanism by which individuals can become aligned. Large scale investigations of the effect of mirroring on alignment have been limited due to the scalability of traditional experimental designs in sociology. In this paper, we introduce a simple computational framework that enables studying the effect of mirroring behavior on alignment in multi-agent systems. We simulate systems of interacting large language models in this framework and characterize overall system behavior and alignment with quantitative measures of agent dynamics. We find that system behavior is strongly influenced by the range of communication of each agent and that these effects are exacerbated by increased rates of mirroring. We discuss the observed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLanguage and cultural evolution
