Multi-level Value Alignment in Agentic AI Systems: Survey and Perspectives

Wei Zeng; Hengshu Zhu; Chuan Qin; Han Wu; Yihang Cheng; Sirui Zhang; Xiaowei Jin; Yinuo Shen; Zhenxing Wang; Feimin Zhong; Hui Xiong

arXiv:2506.09656·cs.AI·August 8, 2025

Multi-level Value Alignment in Agentic AI Systems: Survey and Perspectives

Wei Zeng, Hengshu Zhu, Chuan Qin, Han Wu, Yihang Cheng, Sirui Zhang, Xiaowei Jin, Yinuo Shen, Zhenxing Wang, Feimin Zhong, Hui Xiong

PDF

TL;DR

This paper surveys multi-level value alignment in agentic AI systems, especially LLM-based multi-agent systems, examining principles, scenarios, methods, and coordination to address societal and systemic risks.

Contribution

It introduces a comprehensive multi-level framework for understanding and evaluating value alignment in agentic AI, integrating principles, applications, and methodologies.

Findings

01

Structured value principles across macro, meso, micro levels

02

Categorized application scenarios from general to specific

03

Mapped alignment methods to a tiered evaluation framework

Abstract

The ongoing evolution of AI paradigms has propelled AI research into the agentic AI stage. Consequently, the focus of research has shifted from single agents and simple applications towards multi-agent autonomous decision-making and task collaboration in complex environments. As Large Language Models (LLMs) advance, their applications become more diverse and complex, leading to increasing situational and systemic risks. This has brought significant attention to value alignment for agentic AI systems, which aims to ensure that an agent's goals, preferences, and behaviors align with human values and societal norms. Addressing socio-governance demands through a Multi-level Value framework, this study comprehensively reviews value alignment in LLM-based multi-agent systems as the representative archetype of agentic AI systems. Our survey systematically examines three interconnected…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.