Human-Like Navigation Behavior: A Statistical Evaluation Framework

Ian Colbert; Mehdi Saeedi

arXiv:2203.05965·stat.ML·March 14, 2022·1 cites

Human-Like Navigation Behavior: A Statistical Evaluation Framework

Ian Colbert, Mehdi Saeedi

PDF

Open Access

TL;DR

This paper introduces a statistical framework using a non-parametric two-sample hypothesis test to evaluate how human-like artificial agents' navigation behaviors are, moving beyond mere task proficiency.

Contribution

It presents a novel hypothesis testing method that quantitatively compares artificial agent behavior to human behavior, providing a more realistic measure of believability.

Findings

01

The $p$-value correlates with human judgment of human-likeness.

02

The method effectively measures behavioral similarity between humans and AI agents.

03

Proficiency alone does not indicate human-like behavior.

Abstract

Recent advancements in deep reinforcement learning have brought forth an impressive display of highly skilled artificial agents capable of complex intelligent behavior. In video games, these artificial agents are increasingly deployed as non-playable characters (NPCs) designed to enhance the experience of human players. However, while it has been shown that the convincing human-like behavior of NPCs leads to increased engagement in video games, the believability of an artificial agent's behavior is most often measured solely by its proficiency at a given task. Recent work has hinted that proficiency alone is not sufficient to discern human-like behavior. Motivated by this, we build a non-parametric two-sample hypothesis test designed to compare the behaviors of artificial agents to those of human players. We show that the resulting $p$ -value not only aligns with anonymous human judgment…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSports Analytics and Performance · Reinforcement Learning in Robotics