DNN Modularization via Activation-Driven Training
Tuan Ngo, Abid Hassan, Saad Shafiq, Nenad Medvidovic

TL;DR
MODA introduces an activation-driven training method that inherently modularizes DNNs, reducing training time and model size while maintaining accuracy and improving class-specific performance.
Contribution
This work presents MODA, a novel activation-based training approach that promotes DNN modularity during training without auxiliary masks or significant accuracy loss.
Findings
MODA reduces training time by 22%.
Modules have up to 24x fewer weights and 37x less overlap.
MODA preserves accuracy and enhances class-specific performance.
Abstract
Deep Neural Networks (DNNs) tend to accrue technical debt and suffer from significant retraining costs when adapting to evolving requirements. Modularizing DNNs offers the promise of improving their reusability. Previous work has proposed techniques to decompose DNN models into modules both during and after training. However, these strategies yield several shortcomings, including significant weight overlaps and accuracy losses across modules, restricted focus on convolutional layers only, and added complexity and training time by introducing auxiliary masks to control modularity. In this work, we propose MODA, an activation-driven modular training approach. MODA promotes inherent modularity within a DNN model by directly regulating the activation outputs of its layers based on three modular objectives: intra-class affinity, inter-class dispersion, and compactness. MODA is evaluated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Automated Systems · Wireless Body Area Networks · Context-Aware Activity Recognition Systems
MethodsFocus
