Improving planning and MBRL with temporally-extended actions

Palash Chatterjee; Roni Khardon

arXiv:2505.15754·cs.LG·October 23, 2025

Improving planning and MBRL with temporally-extended actions

Palash Chatterjee, Roni Khardon

PDF

Open Access 1 Video

TL;DR

This paper introduces temporally-extended actions in reinforcement learning to directly control decision timescales, leading to faster planning, improved solutions, and enhanced model-based learning efficiency.

Contribution

It proposes controlling action durations as an optimization variable, integrating multi-armed bandits for automatic duration selection, and demonstrating significant improvements in planning and MBRL.

Findings

01

Faster planning and better solutions in various tasks.

02

Reduced model errors and training time in MBRL.

03

Ability to solve previously unsolvable problems.

Abstract

Continuous time systems are often modeled using discrete time dynamics but this requires a small simulation step to maintain accuracy. In turn, this requires a large planning horizon which leads to computationally demanding planning problems and reduced performance. Previous work in model-free reinforcement learning has partially addressed this issue using action repeats where a policy is learned to determine a discrete action duration. Instead we propose to control the continuous decision timescale directly by using temporally-extended actions and letting the planner treat the duration of the action as an additional optimization variable along with the standard action variables. This additional structure has multiple advantages. It speeds up simulation time of trajectories and, importantly, it allows for deep horizon search in terms of primitive actions while using a shallow search…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Improving planning and MBRL with temporally-extended actions· slideslive

Taxonomy

TopicsAI-based Problem Solving and Planning · Advanced Database Systems and Queries · Logic, Reasoning, and Knowledge