Toward Human-AI Alignment in Large-Scale Multi-Player Games

Sugandha Sharma; Guy Davidson; Khimya Khetarpal; Anssi Kanervisto,; Udit Arora; Katja Hofmann; Ida Momennejad

arXiv:2402.03575·cs.AI·June 21, 2024·1 cites

Toward Human-AI Alignment in Large-Scale Multi-Player Games

Sugandha Sharma, Guy Davidson, Khimya Khetarpal, Anssi Kanervisto,, Udit Arora, Katja Hofmann, Ida Momennejad

PDF

Open Access

TL;DR

This paper introduces an interpretable framework for evaluating human-AI behavioral alignment in complex multiplayer games, analyzing extensive gameplay data and comparing AI and human behaviors in a high-dimensional behavioral space.

Contribution

It proposes a novel behavior manifold framework to interpret and compare human and AI gameplay behaviors at a high level, advancing human-AI alignment research in multiplayer gaming.

Findings

01

Humans show variability in fight-flight and explore-exploit behaviors.

02

AI agents tend to be more uniform and engage mainly in solo play.

03

Significant behavioral differences highlight the need for interpretable AI evaluation.

Abstract

Achieving human-AI alignment in complex multi-agent games is crucial for creating trustworthy AI agents that enhance gameplay. We propose a method to evaluate this alignment using an interpretable task-sets framework, focusing on high-level behavioral tasks instead of low-level policies. Our approach has three components. First, we analyze extensive human gameplay data from Xbox's Bleeding Edge (100K+ games), uncovering behavioral patterns in a complex task space. This task space serves as a basis set for a behavior manifold capturing interpretable axes: fight-flight, explore-exploit, and solo-multi-agent. Second, we train an AI agent to play Bleeding Edge using a Generative Pretrained Causal Transformer and measure its behavior. Third, we project human and AI gameplay to the proposed behavior manifold to compare and contrast. This allows us to interpret differences in policy as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSimulation Techniques and Applications · Artificial Intelligence in Games · Reinforcement Learning in Robotics

MethodsAttention Is All You Need · Sparse Evolutionary Training · Residual Connection · Layer Normalization · Dense Connections · Position-Wise Feed-Forward Layer · Label Smoothing · Softmax · Absolute Position Encodings · Linear Layer