Multi-Timescale, Gradient Descent, Temporal Difference Learning with Linear Options
Peeyush Kumar, Doina Precup

TL;DR
This paper introduces an efficient, convergent algorithm for reinforcement learning that utilizes multi-timescale temporal abstractions and linear function approximation to improve planning and reduce decision time in complex environments.
Contribution
The paper presents a novel algorithm that combines multi-timescale options with linear function approximation, enabling efficient and convergent planning in large state spaces.
Findings
Reduces number of decision epochs needed to solve tasks.
Enables real-time planning with temporally abstract actions.
Demonstrates convergence under linear function approximation.
Abstract
Deliberating on large or continuous state spaces have been long standing challenges in reinforcement learning. Temporal Abstraction have somewhat made this possible, but efficiently planing using temporal abstraction still remains an issue. Moreover using spatial abstractions to learn policies for various situations at once while using temporal abstraction models is an open problem. We propose here an efficient algorithm which is convergent under linear function approximation while planning using temporally abstract actions. We show how this algorithm can be used along with randomly generated option models over multiple time scales to plan agents which need to act real time. Using these randomly generated option models over multiple time scales are shown to reduce number of decision epochs required to solve the given task, hence effectively reducing the time needed for deliberation.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Model Reduction and Neural Networks · Advanced Vision and Imaging
