Quasi-orthogonality and intrinsic dimensions as measures of learning and   generalisation

Qinghua Zhou; Alexander N. Gorban; Evgeny M. Mirkes; Jonathan Bac,; Andrei Zinovyev; Ivan Y. Tyukin

arXiv:2203.16687·cs.LG·April 1, 2022

Quasi-orthogonality and intrinsic dimensions as measures of learning and generalisation

Qinghua Zhou, Alexander N. Gorban, Evgeny M. Mirkes, Jonathan Bac,, Andrei Zinovyev, Ivan Y. Tyukin

PDF

Open Access

TL;DR

This paper investigates whether measures like intrinsic dimensionality and quasi-orthogonality of neural network feature spaces, computed at initialization, can predict trained network performance, potentially aiding neural architecture search.

Contribution

It introduces and validates the use of intrinsic dimensionality and quasi-orthogonality as principled, computable measures correlated with neural network success, expanding on prior work by Mellor et al.

Findings

01

Dimensionality and quasi-orthogonality can discriminate network performance.

02

These measures correlate with final accuracy after training.

03

Potential to accelerate neural architecture search processes.

Abstract

Finding best architectures of learning machines, such as deep neural networks, is a well-known technical and theoretical challenge. Recent work by Mellor et al (2021) showed that there may exist correlations between the accuracies of trained networks and the values of some easily computable measures defined on randomly initialised networks which may enable to search tens of thousands of neural architectures without training. Mellor et al used the Hamming distance evaluated over all ReLU neurons as such a measure. Motivated by these findings, in our work, we ask the question of the existence of other and perhaps more principled measures which could be used as determinants of success of a given neural architecture. In particular, we examine, if the dimensionality and quasi-orthogonality of neural networks' feature space could be correlated with the network's performance after training. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Face and Expression Recognition · Machine Learning and Data Classification