Unveiling Options with Neural Decomposition

Mahdi Alikhasi; Levi H. S. Lelis

arXiv:2410.11262·cs.LG·October 16, 2024

Unveiling Options with Neural Decomposition

Mahdi Alikhasi, Levi H. S. Lelis

PDF

Open Access

TL;DR

This paper presents a neural decomposition method for reinforcement learning policies, enabling the extraction of reusable options from neural networks to improve generalization and accelerate learning across related tasks.

Contribution

It introduces a novel algorithm that decomposes neural network policies into sub-policies as options, facilitating transfer and reuse in reinforcement learning.

Findings

01

Successfully identifies useful options in grid-world domains

02

Accelerates learning on related tasks

03

Demonstrates effectiveness in complex exploration scenarios

Abstract

In reinforcement learning, agents often learn policies for specific tasks without the ability to generalize this knowledge to related tasks. This paper introduces an algorithm that attempts to address this limitation by decomposing neural networks encoding policies for Markov Decision Processes into reusable sub-policies, which are used to synthesize temporally extended actions, or options. We consider neural networks with piecewise linear activation functions, so that they can be mapped to an equivalent tree that is similar to oblique decision trees. Since each node in such a tree serves as a function of the input of the tree, each sub-tree is a sub-policy of the main policy. We turn each of these sub-policies into options by wrapping it with while-loops of varied number of iterations. Given the large number of options, we propose a selection mechanism based on minimizing the Levin…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCapital Investment and Risk Analysis