Hard Attention Control By Mutual Information Maximization
Himanshu Sahni, Charles Isbell

TL;DR
This paper introduces a method for controlling a hard attention window in artificial agents by maximizing mutual information, enabling effective partial observation and state prediction in partially observable environments.
Contribution
It presents a novel approach combining mutual information maximization with a dynamic memory architecture for attention control in partially observable tasks.
Findings
Effective full state prediction from partial observations.
Successful control of attention in reinforcement learning tasks.
Internal world model aids in decision making.
Abstract
Biological agents have adopted the principle of attention to limit the rate of incoming information from the environment. One question that arises is if an artificial agent has access to only a limited view of its surroundings, how can it control its attention to effectively solve tasks? We propose an approach for learning how to control a hard attention window by maximizing the mutual information between the environment state and the attention location at each step. The agent employs an internal world model to make predictions about its state and focuses attention towards where the predictions may be wrong. Attention is trained jointly with a dynamic memory architecture that stores partial observations and keeps track of the unobserved state. We demonstrate that our approach is effective in predicting the full state from a sequence of partial observations. We also show that the agent's…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Neural dynamics and brain function · Explainable Artificial Intelligence (XAI)
