LearningGroup: A Real-Time Sparse Training on FPGA via Learnable Weight Grouping for Multi-Agent Reinforcement Learning
Je Yang, JaeUk Kim, Joo-Young Kim

TL;DR
This paper introduces LearningGroup, an FPGA-based system that accelerates multi-agent reinforcement learning training using real-time sparse network pruning and a novel weight grouping algorithm, achieving significant speed and efficiency improvements.
Contribution
It presents the first application of network pruning in MARL training with an FPGA accelerator, including a new on-chip sparse data encoding method and a co-designed architecture for high performance.
Findings
Up to 5.72x reduction in cycle time and 6.81x in memory footprint.
Achieves 257.40-3629.48 GFLOPS throughput and 7.10-100.12 GFLOPS/W energy efficiency.
Up to 12.52x speedup over dense data processing.
Abstract
Multi-agent reinforcement learning (MARL) is a powerful technology to construct interactive artificial intelligent systems in various applications such as multi-robot control and self-driving cars. Unlike supervised model or single-agent reinforcement learning, which actively exploits network pruning, it is obscure that how pruning will work in multi-agent reinforcement learning with its cooperative and interactive characteristics. \par In this paper, we present a real-time sparse training acceleration system named LearningGroup, which adopts network pruning on the training of MARL for the first time with an algorithm/architecture co-design approach. We create sparsity using a weight grouping algorithm and propose on-chip sparse data encoding loop (OSEL) that enables fast encoding with efficient implementation. Based on the OSEL's encoding format, LearningGroup performs efficient weight…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVLSI and FPGA Design Techniques · Advanced Memory and Neural Computing · Neural Networks and Reservoir Computing
MethodsPruning
