Action-Sufficient Goal Representations
Jinu Hyeon, Woobin Park, Hongjoon Ahn, and Taesup Moon

TL;DR
This paper introduces an information-theoretic framework for goal representations in hierarchical offline goal-conditioned reinforcement learning, emphasizing action sufficiency over value sufficiency to improve control performance.
Contribution
It defines action sufficiency, proves its distinction from value sufficiency, and shows that standard training naturally induces action-sufficient representations for better control.
Findings
Action-sufficient representations outperform value-sufficient ones in control tasks.
Standard log-loss training induces action-sufficient goal representations.
Empirical validation on benchmark demonstrates improved control success.
Abstract
Hierarchical policies in offline goal-conditioned reinforcement learning (GCRL) addresses long-horizon tasks by decomposing control into high-level subgoal planning and low-level action execution. A critical design choice in such architectures is the goal representation-the compressed encoding of goals that serves as the interface between these levels. Existing approaches commonly derive goal representations while learning value functions, implicitly assuming that preserving information sufficient for value estimation is adequate for optimal control. We show that this assumption can fail, even when the value estimation is exact, as such representations may collapse goal states that need to be differentiated for action learning. To address this, we introduce an information-theoretic framework that defines action sufficiency, a condition on goal representations necessary for optimal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · AI-based Problem Solving and Planning · Robot Manipulation and Learning
