Flexible Attention-Based Multi-Policy Fusion for Efficient Deep   Reinforcement Learning

Zih-Yun Chiu; Yi-Lin Tuan; William Yang Wang; Michael C. Yip

arXiv:2210.03729·cs.LG·October 11, 2023

Flexible Attention-Based Multi-Policy Fusion for Efficient Deep Reinforcement Learning

Zih-Yun Chiu, Yi-Lin Tuan, William Yang Wang, Michael C. Yip

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a novel reinforcement learning framework called KGRL that integrates multiple external knowledge policies using a flexible, attention-based actor architecture, enhancing efficiency and adaptability in learning tasks.

Contribution

The paper proposes the Knowledge-Inclusive Attention Network (KIAN), enabling arbitrary policy combination and addressing entropy imbalance for improved RL performance.

Findings

01

KIAN outperforms existing methods with external knowledge integration.

02

KGRL achieves human-like learning efficiency and flexibility.

03

The approach effectively manages entropy imbalance in maximum entropy RL.

Abstract

Reinforcement learning (RL) agents have long sought to approach the efficiency of human learning. Humans are great observers who can learn by aggregating external knowledge from various sources, including observations from others' policies of attempting a task. Prior studies in RL have incorporated external knowledge policies to help agents improve sample efficiency. However, it remains non-trivial to perform arbitrary combinations and replacements of those policies, an essential feature for generalization and transferability. In this work, we present Knowledge-Grounded RL (KGRL), an RL paradigm fusing multiple knowledge policies and aiming for human-like efficiency and flexibility. We propose a new actor architecture for KGRL, Knowledge-Inclusive Attention Network (KIAN), which allows free knowledge rearrangement due to embedding-based attentive action prediction. KIAN also addresses…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pascalson/kgrl
pytorchOfficial

Videos

Flexible Attention-Based Multi-Policy Fusion for Efficient Deep Reinforcement Learning· slideslive

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Reinforcement Learning in Robotics