Hierarchical Reinforcement Learning as a Model of Human Task Interleaving
Christoph Gebhardt, Antti Oulasvirta, Otmar Hilliges

TL;DR
This paper proposes a hierarchical reinforcement learning model that explains how humans decide when to switch between tasks, accounting for complex environments and outperforming simpler models in predicting human behavior.
Contribution
It introduces a hierarchical RL framework for modeling task interleaving, capturing adaptive switching in complex, uncertain environments, and demonstrating improved predictive accuracy over baseline models.
Findings
The model reproduces empirical effects of task interleaving.
It predicts individual behavior better than a myopic baseline.
Hierarchical RL effectively captures human task-switching strategies.
Abstract
How do people decide how long to continue in a task, when to switch, and to which other task? Understanding the mechanisms that underpin task interleaving is a long-standing goal in the cognitive sciences. Prior work suggests greedy heuristics and a policy maximizing the marginal rate of return. However, it is unclear how such a strategy would allow for adaptation to everyday environments that offer multiple tasks with complex switch costs and delayed rewards. Here we develop a hierarchical model of supervisory control driven by reinforcement learning (RL). The supervisory level learns to switch using task-specific approximate utility estimates, which are computed on the lower level. A hierarchically optimal value function decomposition can be learned from experience, even in conditions with multiple tasks and arbitrary and uncertain reward and cost structures. The model reproduces…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural and Behavioral Psychology Studies · Personal Information Management and User Behavior · Functional Brain Connectivity Studies
