Action and Perception as Divergence Minimization

Danijar Hafner; Pedro A. Ortega; Jimmy Ba; Thomas Parr; Karl Friston,; Nicolas Heess

arXiv:2009.01791·cs.AI·February 15, 2022·23 cites

Action and Perception as Divergence Minimization

Danijar Hafner, Pedro A. Ortega, Jimmy Ba, Thomas Parr, Karl Friston,, Nicolas Heess

PDF

Open Access 1 Repo

TL;DR

This paper introduces the Action Perception Divergence (APD) framework, categorizing objectives for embodied agents from narrow rewards to general information-maximizing goals, unifying many unsupervised learning approaches under a single principle.

Contribution

The paper proposes the APD framework, providing a unified perspective on diverse objectives for agents, linking reinforcement learning, representation learning, and intrinsic motivation.

Findings

01

APD categorizes objectives from narrow rewards to general information maximization.

02

Agents using APD principles can explore and adapt without explicit task rewards.

03

The framework unifies various unsupervised learning objectives under a common principle.

Abstract

To learn directed behaviors in complex environments, intelligent agents need to optimize objective functions. Various objectives are known for designing artificial agents, including task rewards and intrinsic motivation. However, it is unclear how the known objectives relate to each other, which objectives remain yet to be discovered, and which objectives better describe the behavior of humans. We introduce the Action Perception Divergence (APD), an approach for categorizing the space of possible objective functions for embodied agents. We show a spectrum that reaches from narrow to general objectives. While the narrow objectives correspond to domain-specific rewards as typical in reinforcement learning, the general objectives maximize information with the environment through latent variable models of input sequences. Intuitively, these agents use perception to align their beliefs with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

CatarauCorina/citai_reading_list
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Reinforcement Learning in Robotics · Embodied and Extended Cognition