Generalizing Multi-Step Inverse Models for Representation Learning to   Finite-Memory POMDPs

Lili Wu; Ben Evans; Riashat Islam; Raihan Seraj; Yonathan Efroni; Alex; Lamb

arXiv:2404.14552·cs.LG·April 24, 2024

Generalizing Multi-Step Inverse Models for Representation Learning to Finite-Memory POMDPs

Lili Wu, Ben Evans, Riashat Islam, Raihan Seraj, Yonathan Efroni, Alex, Lamb

PDF

Open Access

TL;DR

This paper extends inverse models to learn agent-centric state representations in high-dimensional, non-Markovian environments, providing theoretical insights and empirical evaluations of different algorithms.

Contribution

It introduces a generalized inverse model approach for state discovery in non-Markovian settings and offers both theoretical analysis and empirical validation.

Findings

01

Inverse models can be adapted for non-Markovian environments.

02

Past actions can improve or hinder state discovery depending on their use.

03

Empirical results demonstrate the effectiveness and limitations of proposed methods.

Abstract

Discovering an informative, or agent-centric, state representation that encodes only the relevant information while discarding the irrelevant is a key challenge towards scaling reinforcement learning algorithms and efficiently applying them to downstream tasks. Prior works studied this problem in high-dimensional Markovian environments, when the current observation may be a complex object but is sufficient to decode the informative state. In this work, we consider the problem of discovering the agent-centric state in the more challenging high-dimensional non-Markovian setting, when the state can be decoded from a sequence of past observations. We establish that generalized inverse models can be adapted for learning agent-centric state representation for this task. Our results include asymptotic theory in the deterministic dynamics setting as well as counter-examples for alternative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Speech Recognition and Synthesis · Topic Modeling