Modularizing while Training: A New Paradigm for Modularizing DNN Models
Binhang Qi, Hailong Sun, Hongyu Zhang, Ruobing Zhao, Xiang Gao

TL;DR
This paper introduces a novel training approach called modularizing-while-training (MwT) that creates structurally modular DNNs with minimal accuracy loss and significantly reduced training time, improving upon existing modularization methods.
Contribution
The paper presents a new training paradigm that integrates modularization into the training process, resulting in more efficient and effective modular DNN models compared to prior post-training modularization techniques.
Findings
MwT achieves only 1.13% accuracy loss, less than baseline.
Kernel retention rate is reduced by 74.31% compared to state-of-the-art.
Training and modularizing time is halved to 108 minutes.
Abstract
Deep neural network (DNN) models have become increasingly crucial components in intelligent software systems. However, training a DNN model is typically expensive in terms of both time and money. To address this issue, researchers have recently focused on reusing existing DNN models - borrowing the idea of code reuse in software engineering. However, reusing an entire model could cause extra overhead or inherits the weakness from the undesired functionalities. Hence, existing work proposes to decompose an already trained model into modules, i.e., modularizing-after-training, and enable module reuse. Since trained models are not built for modularization, modularizing-after-training incurs huge overhead and model accuracy loss. In this paper, we propose a novel approach that incorporates modularization into the model training process, i.e., modularizing-while-training (MwT). We train a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Machine Learning in Materials Science · Software System Performance and Reliability
