Blockchain-assisted Demonstration Cloning for Multi-Agent Deep   Reinforcement Learning

Ahmed Alagha; Jamal Bentahar; Hadi Otrok; Shakti Singh; Rabeb Mizouni

arXiv:2501.10938·cs.LG·January 22, 2025

Blockchain-assisted Demonstration Cloning for Multi-Agent Deep Reinforcement Learning

Ahmed Alagha, Jamal Bentahar, Hadi Otrok, Shakti Singh, Rabeb Mizouni

PDF

TL;DR

This paper introduces a Blockchain-assisted framework for Multi-Agent Deep Reinforcement Learning that leverages expert demonstrations and model sharing to improve learning efficiency and robustness against malicious models.

Contribution

It proposes a novel Blockchain-based demonstration cloning framework that enables secure model sharing and expert guidance in MDRL, addressing existing challenges in sample efficiency and security.

Findings

01

Outperforms existing methods in learning speed

02

Demonstrates robustness against malicious models

03

Enables traceable and autonomous model sharing

Abstract

Multi-Agent Deep Reinforcement Learning (MDRL) is a promising research area in which agents learn complex behaviors in cooperative or competitive environments. However, MDRL comes with several challenges that hinder its usability, including sample efficiency, curse of dimensionality, and environment exploration. Recent works proposing Federated Reinforcement Learning (FRL) to tackle these issues suffer from problems related to model restrictions and maliciousness. Other proposals using reward shaping require considerable engineering and could lead to local optima. In this paper, we propose a novel Blockchain-assisted Multi-Expert Demonstration Cloning (MEDC) framework for MDRL. The proposed method utilizes expert demonstrations in guiding the learning of new MDRL agents, by suggesting exploration actions in the environment. A model sharing framework on Blockchain is designed to allow…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings