Reusing Convolutional Neural Network Models through Modularization and Composition
Binhang Qi, Hailong Sun, Hongyu Zhang, Xiang Gao

TL;DR
This paper introduces modularization methods for CNNs, enabling the decomposition of trained models into small, reusable class-specific modules that can be reused or combined to improve or build models efficiently.
Contribution
The paper proposes two novel modularization approaches, CNNSplitter and GradSplitter, for decomposing CNNs into class-specific modules, enhancing reusability and model patching capabilities.
Findings
GradSplitter results in less accuracy loss than CNNSplitter.
Modules are significantly smaller, reducing kernel count by 19.88%.
Reusing modules achieves similar accuracy to training from scratch, with only 2.46% average loss.
Abstract
With the widespread success of deep learning technologies, many trained deep neural network (DNN) models are now publicly available. However, directly reusing the public DNN models for new tasks often fails due to mismatching functionality or performance. Inspired by the notion of modularization and composition in software reuse, we investigate the possibility of improving the reusability of DNN models in a more fine-grained manner. Specifically, we propose two modularization approaches named CNNSplitter and GradSplitter, which can decompose a trained convolutional neural network (CNN) model for -class classification into small reusable modules. Each module recognizes one of the classes and contains a part of the convolution kernels of the trained CNN model. Then, the resulting modules can be reused to patch existing CNN models or build new CNN models through composition. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Machine Learning and Data Classification · Domain Adaptation and Few-Shot Learning
