Guaranteed Discovery of Control-Endogenous Latent States with Multi-Step   Inverse Models

Alex Lamb; Riashat Islam; Yonathan Efroni; Aniket Didolkar; Dipendra; Misra; Dylan Foster; Lekan Molu; Rajan Chari; Akshay Krishnamurthy; John; Langford

arXiv:2207.08229·cs.LG·December 29, 2022·1 cites

Guaranteed Discovery of Control-Endogenous Latent States with Multi-Step Inverse Models

Alex Lamb, Riashat Islam, Yonathan Efroni, Aniket Didolkar, Dipendra, Misra, Dylan Foster, Lekan Molu, Rajan Chari, Akshay Krishnamurthy, John, Langford

PDF

Open Access

TL;DR

This paper introduces AC-State, an algorithm that guarantees the discovery of minimal, control-relevant latent states from high-dimensional sensory data, enabling effective control and exploration without rewards or demonstrations.

Contribution

The paper proposes a theoretically guaranteed multi-step inverse model with an information bottleneck to identify control-endogenous latent states, advancing state discovery in complex environments.

Findings

01

Successfully localizes a robot arm with distractions

02

Explores mazes with multiple agents without rewards

03

Navigates in complex house simulations effectively

Abstract

In many sequential decision-making tasks, the agent is not able to model the full complexity of the world, which consists of multitudes of relevant and irrelevant information. For example, a person walking along a city street who tries to model all aspects of the world would quickly be overwhelmed by a multitude of shops, cars, and people moving in and out of view, each following their own complex and inscrutable dynamics. Is it possible to turn the agent's firehose of sensory information into a minimal latent state that is both necessary and sufficient for an agent to successfully act in the world? We formulate this question concretely, and propose the Agent Control-Endogenous State Discovery algorithm (AC-State), which has theoretical guarantees and is practically demonstrated to discover the minimal control-endogenous latent state which contains all of the information necessary for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTime Series Analysis and Forecasting · Reinforcement Learning in Robotics · AI-based Problem Solving and Planning