Subgoal Discovery Using a Free Energy Paradigm and State Aggregations

Amirhossein Mesbah; Reshad Hosseini; Seyed Pooya Shariatpanahi; Majid; Nili Ahmadabadi

arXiv:2412.16687·cs.LG·February 11, 2025

Subgoal Discovery Using a Free Energy Paradigm and State Aggregations

Amirhossein Mesbah, Reshad Hosseini, Seyed Pooya Shariatpanahi, Majid, Nili Ahmadabadi

PDF

Open Access

TL;DR

This paper introduces a novel subgoal discovery method in reinforcement learning using a free energy paradigm, which identifies unpredictable states for hierarchical task decomposition without prior knowledge, applicable even in stochastic environments.

Contribution

It proposes a new approach leveraging free energy to discover subgoals based on state unpredictability, enhancing hierarchical RL methods.

Findings

01

Effective subgoal discovery in navigation tasks

02

Robust performance in stochastic environments

03

No prior task knowledge required

Abstract

Reinforcement learning (RL) plays a major role in solving complex sequential decision-making tasks. Hierarchical and goal-conditioned RL are promising methods for dealing with two major problems in RL, namely sample inefficiency and difficulties in reward shaping. These methods tackle the mentioned problems by decomposing a task into simpler subtasks and temporally abstracting a task in the action space. One of the key components for task decomposition of these methods is subgoal discovery. We can use the subgoal states to define hierarchies of actions and also use them in decomposing complex tasks. Under the assumption that subgoal states are more unpredictable, we propose a free energy paradigm to discover them. This is achieved by using free energy to select between two spaces, the main space and an aggregation space. The $m o d e l c han g es$ from neighboring states to a given state…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications