RNM-TD3: N:M Semi-structured Sparse Reinforcement Learning From Scratch
Isam Vrce, Andreas Kassler, G\"ok\c{c}e Aydos

TL;DR
This paper introduces RNM-TD3, a reinforcement learning method utilizing N:M structured sparsity to compress neural networks, improve performance, and enable hardware acceleration, demonstrating superior results on continuous-control benchmarks.
Contribution
First study applying N:M structured sparsity in reinforcement learning, balancing compression, performance, and hardware efficiency during training.
Findings
Outperforms dense networks at 50-75% sparsity
Achieves up to 14% performance increase at 2:4 sparsity
Remains competitive at 87.5% sparsity, enabling training speedups
Abstract
Sparsity is a well-studied technique for compressing deep neural networks (DNNs) without compromising performance. In deep reinforcement learning (DRL), neural networks with up to 5% of their original weights can still be trained with minimal performance loss compared to their dense counterparts. However, most existing methods rely on unstructured fine-grained sparsity, which limits hardware acceleration opportunities due to irregular computation patterns. Structured coarse-grained sparsity enables hardware acceleration, yet typically degrades performance and increases pruning complexity. In this work, we present, to the best of our knowledge, the first study on N:M structured sparsity in RL, which balances compression, performance, and hardware efficiency. Our framework enforces row-wise N:M sparsity throughout training for all networks in off-policy RL (TD3), maintaining compatibility…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Advanced Neural Network Applications · Adversarial Robustness in Machine Learning
