Action-Sufficient State Representation Learning for Control with   Structural Constraints

Biwei Huang; Chaochao Lu; Liu Leqi; Jos\'e Miguel Hern\'andez-Lobato,; Clark Glymour; Bernhard Sch\"olkopf; Kun Zhang

arXiv:2110.05721·cs.LG·June 22, 2022·5 cites

Action-Sufficient State Representation Learning for Control with Structural Constraints

Biwei Huang, Chaochao Lu, Liu Leqi, Jos\'e Miguel Hern\'andez-Lobato,, Clark Glymour, Bernhard Sch\"olkopf, Kun Zhang

PDF

Open Access

TL;DR

This paper introduces Action-Sufficient State Representations (ASRs) for partially observable environments, leveraging structural constraints and variational auto-encoders to improve decision-making efficiency and generalization.

Contribution

It proposes a novel framework for learning minimal, sufficient state representations using structural models and variational auto-encoders, enhancing policy learning in complex environments.

Findings

01

ASRs improve policy learning in CarRacing and VizDoom.

02

Estimated models and ASRs enable learning from imagined outcomes.

03

ASRs enhance sample efficiency in reinforcement learning.

Abstract

Perceived signals in real-world scenarios are usually high-dimensional and noisy, and finding and using their representation that contains essential and sufficient information required by downstream decision-making tasks will help improve computational efficiency and generalization ability in the tasks. In this paper, we focus on partially observable environments and propose to learn a minimal set of state representations that capture sufficient information for decision-making, termed \textit{Action-Sufficient state Representations} (ASRs). We build a generative environment model for the structural relationships among variables in the system and present a principled way to characterize ASRs based on structural constraints and the goal of maximizing cumulative reward in policy learning. We then develop a structured sequential Variational Auto-Encoder to estimate the environment model and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI) · Data Stream Mining Techniques