Reinforcement Learning Control of a Biomechanical Model of the Upper Extremity
Florian Fischer, Miroslav Bachinski, Markus Klar, Arthur Fleig, J\"org, M\"uller

TL;DR
This study demonstrates that simple motor noise assumptions and movement time minimization can predict complex human reaching movements in a detailed biomechanical upper extremity model using reinforcement learning.
Contribution
It extends previous simplified models by applying reinforcement learning to a full skeletal human arm, reproducing key movement phenomena.
Findings
Reproduces Fitts' Law and the 2/3 Power Law
Validates simple noise and optimization assumptions for complex models
Shows reinforcement learning can control detailed biomechanical systems
Abstract
Among the infinite number of possible movements that can be produced, humans are commonly assumed to choose those that optimize criteria such as minimizing movement time, subject to certain movement constraints like signal-dependent and constant motor noise. While so far these assumptions have only been evaluated for simplified point-mass or planar models, we address the question of whether they can predict reaching movements in a full skeletal model of the human upper extremity. We learn a control policy using a motor babbling approach as implemented in reinforcement learning, using aimed movements of the tip of the right index finger towards randomly placed 3D targets of varying size. We use a state-of-the-art biomechanical model, which includes seven actuated degrees of freedom. To deal with the curse of dimensionality, we use a simplified second-order muscle model, acting at each…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
