Multi-focus Attention Network for Efficient Deep Reinforcement Learning
Jinyoung Choi, Beom-Jin Lee, and Byoung-Tak Zhang

TL;DR
This paper introduces MANet, a multi-focus attention network that mimics human perception by focusing on multiple entities in sensory input, leading to more efficient learning in deep reinforcement learning tasks.
Contribution
The paper proposes a novel multi-focus attention mechanism that segments sensory input into partial states and attends to them simultaneously, improving learning efficiency and performance.
Findings
MANet achieves higher scores with fewer experience samples.
It outperforms Deep Q-network and single attention models.
In multi-agent tasks, MANet learns 20% faster than existing models.
Abstract
Deep reinforcement learning (DRL) has shown incredible performance in learning various tasks to the human level. However, unlike human perception, current DRL models connect the entire low-level sensory input to the state-action values rather than exploiting the relationship between and among entities that constitute the sensory input. Because of this difference, DRL needs vast amount of experience samples to learn. In this paper, we propose a Multi-focus Attention Network (MANet) which mimics human ability to spatially abstract the low-level sensory input into multiple entities and attend to them simultaneously. The proposed method first divides the low-level input into several segments which we refer to as partial states. After this segmentation, parallel attention layers attend to the partial states relevant to solving the task. Our model estimates state-action values using these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Neural Networks and Reservoir Computing · Adversarial Robustness in Machine Learning
