Agent Alignment in Evolving Social Norms

Shimin Li; Tianxiang Sun; Qinyuan Cheng; Xipeng Qiu

arXiv:2401.04620·cs.CL·February 21, 2024·2 cites

Agent Alignment in Evolving Social Norms

Shimin Li, Tianxiang Sun, Qinyuan Cheng, Xipeng Qiu

PDF

Open Access

TL;DR

This paper introduces EvolutionaryAgent, an evolutionary framework for aligning AI agents with evolving social norms, emphasizing adaptation through environmental feedback and survival principles to improve alignment over time.

Contribution

It proposes a novel evolutionary approach to agent alignment that accounts for dynamic social norms and environmental feedback, moving beyond passive alignment methods.

Findings

01

EvolutionaryAgent improves alignment with social norms over time.

02

The approach maintains general task proficiency during adaptation.

03

Effective across multiple LLM-based agents and environments.

Abstract

Agents based on Large Language Models (LLMs) are increasingly permeating various domains of human production and life, highlighting the importance of aligning them with human values. The current alignment of AI systems primarily focuses on passively aligning LLMs through human intervention. However, agents possess characteristics like receiving environmental feedback and self-evolution, rendering the LLM alignment methods inadequate. In response, we propose an evolutionary framework for agent evolution and alignment, named EvolutionaryAgent, which transforms agent alignment into a process of evolution and selection under the principle of survival of the fittest. In an environment where social norms continuously evolve, agents better adapted to the current social norms will have a higher probability of survival and proliferation, while those inadequately aligned dwindle over time.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling

MethodsALIGN