Identifying Latent Actions and Dynamics from Offline Data via Demonstrator Diversity
Felix Schur

TL;DR
This paper demonstrates that diverse demonstrator identities in offline data enable the recovery of latent actions and environment dynamics, even without observed actions, under certain conditions.
Contribution
It introduces a theoretical framework showing how demonstrator diversity ensures identifiability of latent actions and dynamics from offline trajectories.
Findings
Identifiability of latent actions and dynamics is achievable with demonstrator diversity.
A matrix factorization approach is used to recover latent transitions from observable data.
Continuity assumptions allow extending results to continuous observation spaces.
Abstract
Can latent actions and environment dynamics be recovered from offline trajectories when actions are never observed? We study this question in a setting where trajectories are action-free but tagged with demonstrator identity. We assume that each demonstrator follows a distinct policy, while the environment dynamics are shared across demonstrators and identity affects the next observation only through the chosen action. Under these assumptions, the conditional next-observation distribution is a mixture of latent action-conditioned transition kernels with demonstrator-specific mixing weights. We show that this induces, for each state, a column-stochastic nonnegative matrix factorization of the observable conditional distribution. Using sufficiently scattered policy diversity and rank conditions, we prove that the latent transitions and demonstrator policies are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Generative Adversarial Networks and Image Synthesis · Markov Chains and Monte Carlo Methods
