Training-free Neural Architecture Search through Variance of Knowledge   of Deep Network Weights

Ond\v{r}ej T\'ybl; Luk\'a\v{s} Neumann

arXiv:2502.04975·cs.CV·February 10, 2025

Training-free Neural Architecture Search through Variance of Knowledge of Deep Network Weights

Ond\v{r}ej T\'ybl, Luk\'a\v{s} Neumann

PDF

Open Access 1 Repo

TL;DR

This paper introduces a training-free proxy based on Fisher Information for neural architecture search, significantly reducing computational costs while achieving state-of-the-art accuracy estimates without training networks.

Contribution

It proposes a novel Fisher Information-based proxy for NAS that eliminates the need for training, backed by strong theoretical foundations and validated on multiple datasets.

Findings

01

State-of-the-art results on three datasets

02

Effective in two different search spaces

03

Proposed a new, more informative metric for NAS

Abstract

Deep learning has revolutionized computer vision, but it achieved its tremendous success using deep network architectures which are mostly hand-crafted and therefore likely suboptimal. Neural Architecture Search (NAS) aims to bridge this gap by following a well-defined optimization paradigm which systematically looks for the best architecture, given objective criterion such as maximal classification accuracy. The main limitation of NAS is however its astronomical computational cost, as it typically requires training each candidate network architecture from scratch. In this paper, we aim to alleviate this limitation by proposing a novel training-free proxy for image classification accuracy based on Fisher Information. The proposed proxy has a strong theoretical background in statistics and it allows estimating expected image classification accuracy of a given deep network without…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ondratybl/vkdnw
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Anomaly Detection Techniques and Applications