Goal Kernel Planning: Linearly-Solvable Non-Markovian Policies for Logical Tasks with Goal-Conditioned Options
Thomas J. Ringstrom, Mohammadhosein Hasanbeig, Alessandro Abate

TL;DR
This paper introduces LS-GKDP, a framework combining linearly-solvable MDPs and options to efficiently solve complex logical tasks with non-Markovian policies, enabling transfer and reuse across tasks.
Contribution
The paper presents LS-GKDP, a novel compositional approach that extends LMDPs to handle logical, goal-conditioned options with ordering constraints, facilitating efficient hierarchical planning.
Findings
Enables efficient optimization of meta-policies in reduced-dimensional spaces.
Supports zero-shot transfer of solutions across a large space of tasks.
Decomposes complex tasks into goal-condition options with goal kernels.
Abstract
In the domain of hierarchical planning, compositionality, abstraction, and task transfer are crucial for designing algorithms that can efficiently solve a variety of problems with maximal representational reuse. Many real-world problems require non-Markovian policies to handle complex structured tasks with logical conditions, often leading to prohibitively large state representations; this requires efficient methods for breaking these problems down and reusing structure between tasks. To this end, we introduce a compositional framework called Linearly-Solvable Goal Kernel Dynamic Programming (LS-GKDP) to address the complexity of solving non-Markovian Boolean sub-goal tasks with ordering constraints. LS-GKDP combines the Linearly-Solvable Markov Decision Process (LMDP) formalism with the Options Framework of Reinforcement Learning. LMDPs can be efficiently solved as a principal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Optimization and Search Problems
