PlatoLTL: Learning to Generalize Across Symbols in LTL Instructions for Multi-Task RL
Jacques Cloete, Mathias Jackermeier, Ioannis Havoutis, Alessandro Abate

TL;DR
PlatoLTL introduces a method for multi-task reinforcement learning that enables policies to generalize to unseen propositions in LTL instructions by modeling propositions as parameterized atomic predicates.
Contribution
It presents a novel architecture that embeds and composes parameterized propositions, allowing zero-shot generalization across unseen symbols in LTL-guided RL tasks.
Findings
Achieves zero-shot generalization to unseen propositions in various environments.
Models shared structure across related propositions for better generalization.
Demonstrates effectiveness in challenging multi-task RL environments.
Abstract
A central challenge in multi-task reinforcement learning (RL) is to train generalist policies capable of performing tasks not seen during training. To facilitate such generalization, linear temporal logic (LTL) has emerged as a powerful formalism for specifying structured, temporally extended tasks to RL agents. While existing approaches to LTL-guided multi-task RL demonstrate generalization across LTL specifications, they are unable to generalize to unseen vocabularies of propositions (or "symbols"), which describe high-level events in LTL. We present PlatoLTL, a novel approach that enables policies to zero-shot generalize not only compositionally across LTL structures, but also parametrically across propositions. We model propositions as parameterized instances of atomic predicates, allowing policies to learn shared structure across related propositions. We propose a novel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
