Flexible Attention-Based Multi-Policy Fusion for Efficient Deep Reinforcement Learning
Zih-Yun Chiu, Yi-Lin Tuan, William Yang Wang, Michael C. Yip

TL;DR
This paper introduces a novel reinforcement learning framework called KGRL that integrates multiple external knowledge policies using a flexible, attention-based actor architecture, enhancing efficiency and adaptability in learning tasks.
Contribution
The paper proposes the Knowledge-Inclusive Attention Network (KIAN), enabling arbitrary policy combination and addressing entropy imbalance for improved RL performance.
Findings
KIAN outperforms existing methods with external knowledge integration.
KGRL achieves human-like learning efficiency and flexibility.
The approach effectively manages entropy imbalance in maximum entropy RL.
Abstract
Reinforcement learning (RL) agents have long sought to approach the efficiency of human learning. Humans are great observers who can learn by aggregating external knowledge from various sources, including observations from others' policies of attempting a task. Prior studies in RL have incorporated external knowledge policies to help agents improve sample efficiency. However, it remains non-trivial to perform arbitrary combinations and replacements of those policies, an essential feature for generalization and transferability. In this work, we present Knowledge-Grounded RL (KGRL), an RL paradigm fusing multiple knowledge policies and aiming for human-like efficiency and flexibility. We propose a new actor architecture for KGRL, Knowledge-Inclusive Attention Network (KIAN), which allows free knowledge rearrangement due to embedding-based attentive action prediction. KIAN also addresses…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Reinforcement Learning in Robotics
