Application-Specific Component-Aware Structured Pruning of Deep Neural Networks in Control via Soft Coefficient Optimization
Ganesh Sundaram, Jonas Ulmen, Amjad Haider, Daniel G\"orges

TL;DR
This paper presents a novel structured pruning framework for deep neural networks used in control applications, optimizing importance metrics with application-specific constraints and balancing model compression with task performance.
Contribution
It introduces a new importance metric calculation framework and two coefficient optimization approaches tailored for application-specific neural network pruning.
Findings
Effective model size reduction while maintaining control performance.
Gradient descent-based coefficient optimization improves compression-performance trade-off.
Validated on MNIST autoencoder and TDMPC agent with positive results.
Abstract
Deep neural networks (DNNs) offer significant flexibility and robust performance. This makes them ideal for building not only system models but also advanced neural network controllers (NNCs). However, their high complexity and computational needs often limit their use. Various model compression strategies have been developed over the past few decades to address these issues. These strategies are effective for general DNNs but do not directly apply to NNCs. NNCs need both size reduction and the retention of key application-specific performance features. In structured pruning, which removes groups of related elements, standard importance metrics often fail to protect these critical characteristics. In this paper, we introduce a novel framework for calculating importance metrics in pruning groups. This framework not only shrinks the model size but also considers various…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Stochastic Gradient Optimization Techniques
