Action and Perception as Divergence Minimization
Danijar Hafner, Pedro A. Ortega, Jimmy Ba, Thomas Parr, Karl Friston,, Nicolas Heess

TL;DR
This paper introduces the Action Perception Divergence (APD) framework, categorizing objectives for embodied agents from narrow rewards to general information-maximizing goals, unifying many unsupervised learning approaches under a single principle.
Contribution
The paper proposes the APD framework, providing a unified perspective on diverse objectives for agents, linking reinforcement learning, representation learning, and intrinsic motivation.
Findings
APD categorizes objectives from narrow rewards to general information maximization.
Agents using APD principles can explore and adapt without explicit task rewards.
The framework unifies various unsupervised learning objectives under a common principle.
Abstract
To learn directed behaviors in complex environments, intelligent agents need to optimize objective functions. Various objectives are known for designing artificial agents, including task rewards and intrinsic motivation. However, it is unclear how the known objectives relate to each other, which objectives remain yet to be discovered, and which objectives better describe the behavior of humans. We introduce the Action Perception Divergence (APD), an approach for categorizing the space of possible objective functions for embodied agents. We show a spectrum that reaches from narrow to general objectives. While the narrow objectives correspond to domain-specific rewards as typical in reinforcement learning, the general objectives maximize information with the environment through latent variable models of input sequences. Intuitively, these agents use perception to align their beliefs with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Reinforcement Learning in Robotics · Embodied and Extended Cognition
