DC and SA: Robust and Efficient Hyperparameter Optimization of   Multi-subnetwork Deep Learning Models

Alex H. Treacher; Albert Montillo

arXiv:2202.11841·cs.LG·February 25, 2022

DC and SA: Robust and Efficient Hyperparameter Optimization of Multi-subnetwork Deep Learning Models

Alex H. Treacher, Albert Montillo

PDF

Open Access

TL;DR

This paper introduces two novel hyperparameter optimization strategies tailored for multi-subnetwork deep learning models, significantly improving efficiency and final model performance by exploiting modular architecture.

Contribution

The paper proposes two new approaches that enhance existing hyperparameter optimization algorithms by leveraging subnetwork structures for faster and more effective tuning.

Findings

01

Optimization efficiency increased up to 23.62x

02

Final accuracy improved by up to 3.5% in classification

03

Regression MSE reduced by 4.4 units

Abstract

We present two novel hyperparameter optimization strategies for optimization of deep learning models with a modular architecture constructed of multiple subnetworks. As complex networks with multiple subnetworks become more frequently applied in machine learning, hyperparameter optimization methods are required to efficiently optimize their hyperparameters. Existing hyperparameter searches are general, and can be used to optimize such networks, however, by exploiting the multi-subnetwork architecture, these searches can be sped up substantially. The proposed methods offer faster convergence to a better-performing final model. To demonstrate this, we propose 2 independent approaches to enhance these prior algorithms: 1) a divide-and-conquer approach, in which the best subnetworks of top-performing models are combined, allowing for more rapid sampling of the hyperparameter search space.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning