Guaranteed Discovery of Control-Endogenous Latent States with Multi-Step Inverse Models
Alex Lamb, Riashat Islam, Yonathan Efroni, Aniket Didolkar, Dipendra, Misra, Dylan Foster, Lekan Molu, Rajan Chari, Akshay Krishnamurthy, John, Langford

TL;DR
This paper introduces AC-State, an algorithm that guarantees the discovery of minimal, control-relevant latent states from high-dimensional sensory data, enabling effective control and exploration without rewards or demonstrations.
Contribution
The paper proposes a theoretically guaranteed multi-step inverse model with an information bottleneck to identify control-endogenous latent states, advancing state discovery in complex environments.
Findings
Successfully localizes a robot arm with distractions
Explores mazes with multiple agents without rewards
Navigates in complex house simulations effectively
Abstract
In many sequential decision-making tasks, the agent is not able to model the full complexity of the world, which consists of multitudes of relevant and irrelevant information. For example, a person walking along a city street who tries to model all aspects of the world would quickly be overwhelmed by a multitude of shops, cars, and people moving in and out of view, each following their own complex and inscrutable dynamics. Is it possible to turn the agent's firehose of sensory information into a minimal latent state that is both necessary and sufficient for an agent to successfully act in the world? We formulate this question concretely, and propose the Agent Control-Endogenous State Discovery algorithm (AC-State), which has theoretical guarantees and is practically demonstrated to discover the minimal control-endogenous latent state which contains all of the information necessary for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Reinforcement Learning in Robotics · AI-based Problem Solving and Planning
