Optimal Selective Attention in Reactive Agents
Roy Fox, Naftali Tishby

TL;DR
This paper introduces a minimum-information principle for selective attention in reactive agents within POMDPs, aiming to optimize information use and control complexity, and explores phenomena like bifurcations affecting policy stability.
Contribution
It proposes a novel minimum-information framework for attention in reactive policies and links POMDP control to reactive control with complex observations, revealing bifurcation phenomena.
Findings
Identification of period doubling bifurcations in optimal attention policies
Reduction of POMDP control to reactive control with complex observations
Insights into stability and chaos in optimal control policies
Abstract
In POMDPs, information about the hidden state, delivered through observations, is both valuable to the agent, allowing it to base its actions on better informed internal states, and a "curse", exploding the size and diversity of the internal state space. One attempt to deal with this is to focus on reactive policies, that only base their actions on the most recent observation. However, even reactive policies can be demanding on resources, and agents need to pay selective attention to only some of the information available to them in observations. In this report we present the minimum-information principle for selective attention in reactive agents. We further motivate this approach by reducing the general problem of optimal control in POMDPs, to reactive control with complex observations. Lastly, we explore a newly discovered phenomenon of this optimization process - period doubling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGame Theory and Applications · Reinforcement Learning in Robotics · Advanced Bandit Algorithms Research
