Toward Global Intent Inference for Human Motion by Inverse Reinforcement Learning
Sarmad Mehrdad, Maxime Sabbah, Vincent Bonnet, Ludovic Righetti

TL;DR
This study demonstrates that a single, time-varying cost function can accurately predict human reaching movements across different subjects and postures, suggesting a unified optimality principle in human motion.
Contribution
It introduces MO-IRL, a fast inverse reinforcement learning method that estimates time-varying cost weights, enabling practical modeling of human reaching behavior with a unified cost function.
Findings
Time-varying weights improve trajectory prediction by 27% RMSE reduction.
Joint-acceleration regulation is the dominant cost component.
A single cost function predicts trajectories across subjects and postures.
Abstract
This paper investigates whether a single, unified cost function can explain and predict human reaching movements, in contrast with existing approaches that rely on subject- or posture-specific optimization criteria. Using the Minimal Observation Inverse Reinforcement Learning (MO-IRL) algorithm, together with a seven-dimensional set of candidate cost terms, we efficiently estimate time-varying cost weights for a standard planar reaching task. MO-IRL provides orders-of-magnitude faster convergence than bilevel formulations, while using only a fraction of the available data, enabling the practical exploration of time-varying cost structures. Three levels of generality are evaluated: Subject-Dependent Posture-Dependent, Subject-Dependent Posture-Independent, and Subject-Independent Posture-Independent. Across all cases, time-varying weights substantially improve trajectory reconstruction,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMotor Control and Adaptation · Balance, Gait, and Falls Prevention · Muscle activation and electromyography studies
