Peephole: Predicting Network Performance Before Training

Boyang Deng; Junjie Yan; Dahua Lin

arXiv:1712.03351·cs.LG·December 12, 2017·84 cites

Peephole: Predicting Network Performance Before Training

Boyang Deng, Junjie Yan, Dahua Lin

PDF

Open Access 1 Repo

TL;DR

This paper introduces Peephole, a method that predicts neural network performance from architecture alone using layer encoding and LSTM, enabling reliable performance estimation before training.

Contribution

It presents a novel unified layer encoding approach combined with LSTM to predict network performance pre-training, addressing the challenge of large design space and training costs.

Findings

01

Achieved accurate performance predictions across multiple datasets.

02

Produced consistent architecture rankings.

03

Demonstrated the method's reliability and generalization.

Abstract

The quest for performant networks has been a significant force that drives the advancements of deep learning in recent years. While rewarding, improving network design has never been an easy journey. The large design space combined with the tremendous cost required for network training poses a major obstacle to this endeavor. In this work, we propose a new approach to this problem, namely, predicting the performance of a network before training, based on its architecture. Specifically, we develop a unified way to encode individual layers into vectors and bring them together to form an integrated description via LSTM. Taking advantage of the recurrent network's strong expressive power, this method can reliably predict the performances of various network architectures. Our empirical studies showed that it not only achieved accurate predictions but also produced consistent rankings across…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pdefraene/cgp_cnn_predictors
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory