Neuroevolutionary Transfer Learning of Deep Recurrent Neural Networks through Network-Aware Adaptation
AbdElRahman ElSaid, Joshua Karns, Alexander Ororbia II, Daniel Krutz,, Zimeng Lyu, Travis Desell

TL;DR
This paper introduces N-ASTL, a novel transfer learning method for deep recurrent neural networks that adaptively modifies network structure based on topology and weight distribution, enabling transfer across diverse tasks.
Contribution
It presents a network-aware adaptive transfer learning approach that overcomes previous architectural constraints, allowing structural modifications during transfer.
Findings
Improves transfer performance on real-world datasets
Enables transfer with different network topologies
Enhances generalization over non-transferred RNNs
Abstract
Transfer learning entails taking an artificial neural network (ANN) that is trained on a source dataset and adapting it to a new target dataset. While this has been shown to be quite powerful, its use has generally been restricted by architectural constraints. Previously, in order to reuse and adapt an ANN's internal weights and structure, the underlying topology of the ANN being transferred across tasks must remain mostly the same while a new output layer is attached, discarding the old output layer's weights. This work introduces network-aware adaptive structure transfer learning (N-ASTL), an advancement over prior efforts to remove this restriction. N-ASTL utilizes statistical information related to the source network's topology and weight distribution in order to inform how new input and output neurons are to be integrated into the existing structure. Results show improvements over…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Machine Learning and ELM
