Expert-Free Online Transfer Learning in Multi-Agent Reinforcement Learning
Alberto Castagna, Ivana Dusparic

TL;DR
This paper introduces EF-OnTL, a novel transfer learning algorithm for multi-agent reinforcement learning that operates without experts, dynamically selecting source agents based on performance and uncertainty, and demonstrates its effectiveness across various benchmark tasks.
Contribution
The paper proposes EF-OnTL, an expert-free, real-time transfer learning method with a new uncertainty estimation technique, improving multi-agent RL without external knowledge or expert agents.
Findings
EF-OnTL performs comparably to advice-based methods without external input.
EF-OnTL outperforms no-transfer approaches, especially in complex tasks.
The proposed sars-RND improves uncertainty estimation in RL environments.
Abstract
Transfer learning in Reinforcement Learning (RL) has been widely studied to overcome training issues of Deep-RL, i.e., exploration cost, data availability and convergence time, by introducing a way to enhance training phase with external knowledge. Generally, knowledge is transferred from expert-agents to novices. While this fixes the issue for a novice agent, a good understanding of the task on expert agent is required for such transfer to be effective. As an alternative, in this paper we propose Expert-Free Online Transfer Learning (EF-OnTL), an algorithm that enables expert-free real-time dynamic transfer learning in multi-agent system. No dedicated expert exists, and transfer source agent and knowledge to be transferred are dynamically selected at each transfer step based on agents' performance and uncertainty. To improve uncertainty estimation, we also propose State Action Reward…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning · Adaptive Dynamic Programming Control
