The Coming Crisis of Multi-Agent Misalignment: AI Alignment Must Be a Dynamic and Social Process
Florian Carichon, Aditi Khandelwal, Marylou Fauchard, Golnoosh Farnadi

TL;DR
This paper argues that AI alignment in multi-agent systems is a complex, dynamic social process requiring new frameworks for evaluation and understanding to prevent misalignment with human values.
Contribution
It highlights the importance of viewing AI alignment as an interactive, social process and calls for new simulation tools and benchmarks for multi-agent environments.
Findings
Social structures influence alignment and value preservation.
Multi-agent interactions can lead to unintended misalignments.
Need for new evaluation frameworks for dynamic, social AI systems.
Abstract
This position paper states that AI Alignment in Multi-Agent Systems (MAS) should be considered a dynamic and interaction-dependent process that heavily depends on the social environment where agents are deployed, either collaborative, cooperative, or competitive. While AI alignment with human values and preferences remains a core challenge, the growing prevalence of MAS in real-world applications introduces a new dynamic that reshapes how agents pursue goals and interact to accomplish various tasks. As agents engage with one another, they must coordinate to accomplish both individual and collective goals. However, this complex social organization may unintentionally misalign some or all of these agents with human values or user preferences. Drawing on social sciences, we analyze how social structure can deter or shatter group and individual values. Based on these analyses, we call on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Agent Systems and Negotiation · Ethics and Social Impacts of AI · Social Robot Interaction and HRI
MethodsMixing Adam and SGD
