Optimal Selective Attention in Reactive Agents

Roy Fox; Naftali Tishby

arXiv:1512.08575·cs.LG·December 31, 2015

Optimal Selective Attention in Reactive Agents

Roy Fox, Naftali Tishby

PDF

Open Access

TL;DR

This paper introduces a minimum-information principle for selective attention in reactive agents within POMDPs, aiming to optimize information use and control complexity, and explores phenomena like bifurcations affecting policy stability.

Contribution

It proposes a novel minimum-information framework for attention in reactive policies and links POMDP control to reactive control with complex observations, revealing bifurcation phenomena.

Findings

01

Identification of period doubling bifurcations in optimal attention policies

02

Reduction of POMDP control to reactive control with complex observations

03

Insights into stability and chaos in optimal control policies

Abstract

In POMDPs, information about the hidden state, delivered through observations, is both valuable to the agent, allowing it to base its actions on better informed internal states, and a "curse", exploding the size and diversity of the internal state space. One attempt to deal with this is to focus on reactive policies, that only base their actions on the most recent observation. However, even reactive policies can be demanding on resources, and agents need to pay selective attention to only some of the information available to them in observations. In this report we present the minimum-information principle for selective attention in reactive agents. We further motivate this approach by reducing the general problem of optimal control in POMDPs, to reactive control with complex observations. Lastly, we explore a newly discovered phenomenon of this optimization process - period doubling…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGame Theory and Applications · Reinforcement Learning in Robotics · Advanced Bandit Algorithms Research