A Learning Algorithm That Attains the Human Optimum in a Repeated   Human-Machine Interaction Game

Jason T. Isa; Lillian J. Ratliff; Samuel A. Burden

arXiv:2501.08626·cs.GT·January 16, 2025

A Learning Algorithm That Attains the Human Optimum in a Repeated Human-Machine Interaction Game

Jason T. Isa, Lillian J. Ratliff, Samuel A. Burden

PDF

Open Access

TL;DR

This paper introduces a game-theoretic learning algorithm that enables systems to learn and minimize human-specific cost functions through observation alone, avoiding complex inverse problem solutions.

Contribution

The paper presents a novel game-theoretic algorithm that learns human cost functions solely from observed actions, outperforming traditional inverse problem methods.

Findings

01

Consistent convergence to human-defined cost minima in experiments.

02

Effective in scalar and multidimensional game settings.

03

Validated through extensive human subject testing.

Abstract

When humans interact with learning-based control systems, a common goal is to minimize a cost function known only to the human. For instance, an exoskeleton may adapt its assistance in an effort to minimize the human's metabolic cost-of-transport. Conventional approaches to synthesizing the learning algorithm solve an inverse problem to infer the human's cost. However, these problems can be ill-posed, hard to solve, or sensitive to problem data. Here we show a game-theoretic learning algorithm that works solely by observing human actions to find the cost minimum, avoiding the need to solve an inverse problem. We evaluate the performance of our algorithm in an extensive set of human subjects experiments, demonstrating consistent convergence to the minimum of a prescribed human cost function in scalar and multidimensional instantiations of the game. We conclude by outlining future…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Intelligent Tutoring Systems and Adaptive Learning · Robotic Path Planning Algorithms

MethodsSparse Evolutionary Training