Learning to Flow from Generative Pretext Tasks for Neural Architecture Encoding
Sunwoo Kim, Hyunjin Hwang, Kijung Shin

TL;DR
This paper introduces FGP, a pre-training method for neural architecture encoders that captures information flow efficiently, significantly improving performance over traditional supervised training methods.
Contribution
FGP is a novel pre-training approach that enables neural architecture encoders to learn information flow without complex structures, enhancing prediction accuracy.
Findings
FGP improves encoder performance by up to 106%.
FGP increases Precision-1% by up to 106%.
FGP reduces processing time compared to flow-based encoders.
Abstract
The performance of a deep learning model on a specific task and dataset depends heavily on its neural architecture, motivating considerable efforts to rapidly and accurately identify architectures suited to the target task and dataset. To achieve this, researchers use machine learning models-typically neural architecture encoders-to predict the performance of a neural architecture. Many state-of-the-art encoders aim to capture information flow within a neural architecture, which reflects how information moves through the forward pass and backpropagation, via a specialized model structure. However, due to their complicated structures, these flow-based encoders are significantly slower to process neural architectures compared to simpler encoders, presenting a notable practical challenge. To address this, we propose FGP, a novel pre-training method for neural architecture encoding that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning · Advanced Neural Network Applications
