Hierarchical Subtask Discovery With Non-Negative Matrix Factorization
Adam C. Earle, Andrew M. Saxe, Benjamin Rosman

TL;DR
This paper introduces a novel subtask discovery algorithm using non-negative matrix factorization within the MLMDP framework, enabling flexible hierarchical decomposition of tasks in reinforcement learning domains.
Contribution
It presents a new method for automatically discovering hierarchical subtasks based on low-rank approximations, improving flexibility and interpretability in hierarchical reinforcement learning.
Findings
Learns intuitive task decompositions across various domains.
Discovers different hierarchical structures depending on task ensembles.
Supports iterative deepening of hierarchical decompositions.
Abstract
Hierarchical reinforcement learning methods offer a powerful means of planning flexible behavior in complicated domains. However, learning an appropriate hierarchical decomposition of a domain into subtasks remains a substantial challenge. We present a novel algorithm for subtask discovery, based on the recently introduced multitask linearly-solvable Markov decision process (MLMDP) framework. The MLMDP can perform never-before-seen tasks by representing them as a linear combination of a previously learned basis set of tasks. In this setting, the subtask discovery problem can naturally be posed as finding an optimal low-rank approximation of the set of tasks the agent will face in a domain. We use non-negative matrix factorization to discover this minimal basis set of tasks, and show that the technique learns intuitive decompositions in a variety of domains. Our method has several…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Web Data Mining and Analysis · Video Analysis and Summarization
