Transfer Learning to Learn with Multitask Neural Model Search
Catherine Wong, Andrea Gesmundo

TL;DR
This paper introduces a multitask neural architecture search framework that transfers learned knowledge across tasks, significantly speeding up architecture search and improving model performance compared to traditional methods.
Contribution
The paper presents the MNMS controller, a novel approach that enables transfer learning across multiple tasks in neural architecture search, reducing search time and enhancing model quality.
Findings
MNMS can perform simultaneous architecture search for multiple tasks.
Pre-trained MNMS controllers accelerate search and improve results on new tasks.
Models discovered by MNMS outperform human-designed architectures.
Abstract
Deep learning models require extensive architecture design exploration and hyperparameter optimization to perform well on a given task. The exploration of the model design space is often made by a human expert, and optimized using a combination of grid search and search heuristics over a large space of possible choices. Neural Architecture Search (NAS) is a Reinforcement Learning approach that has been proposed to automate architecture design. NAS has been successfully applied to generate Neural Networks that rival the best human-designed architectures. However, NAS requires sampling, constructing, and training hundreds to thousands of models to achieve well-performing architectures. This procedure needs to be executed from scratch for each new task. The application of NAS to a wide set of tasks currently lacks a way to transfer generalizable knowledge across tasks. In this paper, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Reinforcement Learning in Robotics
MethodsSigmoid Activation · Tanh Activation · Softmax · Long Short-Term Memory
