DC and SA: Robust and Efficient Hyperparameter Optimization of Multi-subnetwork Deep Learning Models
Alex H. Treacher, Albert Montillo

TL;DR
This paper introduces two novel hyperparameter optimization strategies tailored for multi-subnetwork deep learning models, significantly improving efficiency and final model performance by exploiting modular architecture.
Contribution
The paper proposes two new approaches that enhance existing hyperparameter optimization algorithms by leveraging subnetwork structures for faster and more effective tuning.
Findings
Optimization efficiency increased up to 23.62x
Final accuracy improved by up to 3.5% in classification
Regression MSE reduced by 4.4 units
Abstract
We present two novel hyperparameter optimization strategies for optimization of deep learning models with a modular architecture constructed of multiple subnetworks. As complex networks with multiple subnetworks become more frequently applied in machine learning, hyperparameter optimization methods are required to efficiently optimize their hyperparameters. Existing hyperparameter searches are general, and can be used to optimize such networks, however, by exploiting the multi-subnetwork architecture, these searches can be sped up substantially. The proposed methods offer faster convergence to a better-performing final model. To demonstrate this, we propose 2 independent approaches to enhance these prior algorithms: 1) a divide-and-conquer approach, in which the best subnetworks of top-performing models are combined, allowing for more rapid sampling of the hyperparameter search space.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
