Quasi-orthogonality and intrinsic dimensions as measures of learning and generalisation
Qinghua Zhou, Alexander N. Gorban, Evgeny M. Mirkes, Jonathan Bac,, Andrei Zinovyev, Ivan Y. Tyukin

TL;DR
This paper investigates whether measures like intrinsic dimensionality and quasi-orthogonality of neural network feature spaces, computed at initialization, can predict trained network performance, potentially aiding neural architecture search.
Contribution
It introduces and validates the use of intrinsic dimensionality and quasi-orthogonality as principled, computable measures correlated with neural network success, expanding on prior work by Mellor et al.
Findings
Dimensionality and quasi-orthogonality can discriminate network performance.
These measures correlate with final accuracy after training.
Potential to accelerate neural architecture search processes.
Abstract
Finding best architectures of learning machines, such as deep neural networks, is a well-known technical and theoretical challenge. Recent work by Mellor et al (2021) showed that there may exist correlations between the accuracies of trained networks and the values of some easily computable measures defined on randomly initialised networks which may enable to search tens of thousands of neural architectures without training. Mellor et al used the Hamming distance evaluated over all ReLU neurons as such a measure. Motivated by these findings, in our work, we ask the question of the existence of other and perhaps more principled measures which could be used as determinants of success of a given neural architecture. In particular, we examine, if the dimensionality and quasi-orthogonality of neural networks' feature space could be correlated with the network's performance after training. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Face and Expression Recognition · Machine Learning and Data Classification
