Multi-Agent Reinforcement Learning for Sample-Efficient Deep Neural Network Mapping

Srivatsan Krishnan; Jason Jabbour; Dan Zhang; Natasha Jaques; Aleksandra Faust; Shayegan Omidshafiei; Vijay Janapa Reddi

arXiv:2507.16249·cs.LG·July 23, 2025

Multi-Agent Reinforcement Learning for Sample-Efficient Deep Neural Network Mapping

Srivatsan Krishnan, Jason Jabbour, Dan Zhang, Natasha Jaques, Aleksandra Faust, Shayegan Omidshafiei, Vijay Janapa Reddi

PDF

Open Access

TL;DR

This paper introduces a decentralized multi-agent reinforcement learning framework for neural network mapping that significantly improves sample efficiency and reduces latency and energy consumption in hardware accelerators.

Contribution

It proposes a novel multi-agent RL approach with agent clustering to enhance exploration and sample efficiency in neural network hardware mapping.

Findings

01

30-300x improvement in sample efficiency

02

up to 32.61x latency reduction

03

up to 16.45x energy-delay product reduction

Abstract

Mapping deep neural networks (DNNs) to hardware is critical for optimizing latency, energy consumption, and resource utilization, making it a cornerstone of high-performance accelerator design. Due to the vast and complex mapping space, reinforcement learning (RL) has emerged as a promising approach-but its effectiveness is often limited by sample inefficiency. We present a decentralized multi-agent reinforcement learning (MARL) framework designed to overcome this challenge. By distributing the search across multiple agents, our framework accelerates exploration. To avoid inefficiencies from training multiple agents in parallel, we introduce an agent clustering algorithm that assigns similar mapping parameters to the same agents based on correlation analysis. This enables a decentralized, parallelized learning process that significantly improves sample efficiency. Experimental results…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Algorithms and Applications · Advanced Sensor and Control Systems · Data Stream Mining Techniques