Equivalence in Deep Neural Networks via Conjugate Matrix Ensembles
Mehmet S\"uzen

TL;DR
This paper introduces a numerical method using Mixed Matrix Ensembles and conjugate circular ensembles to detect equivalence between deep neural network architectures, supported by empirical spectral density analysis.
Contribution
It presents a novel spectral density-based approach for establishing neural network equivalences, applicable to both artificial and biological architectures.
Findings
Spectral density differences vanish with specific decay rates.
Cumulative Circular Spectral Difference (CSD) can identify architecture equivalences.
Method applicable to a wide range of vision architectures and biological networks.
Abstract
A numerical approach is developed for detecting the equivalence of deep learning architectures. The method is based on generating Mixed Matrix Ensembles (MMEs) out of deep neural network weight matrices and {\it conjugate circular ensemble} matching the neural architecture topology. Following this, the empirical evidence supports the {\it phenomenon} that difference between spectral densities of neural architectures and corresponding {\it conjugate circular ensemble} are vanishing with different decay rates at the long positive tail part of the spectrum i.e., cumulative Circular Spectral Difference (CSD). This finding can be used in establishing equivalences among different neural architectures via analysis of fluctuations in CSD. We investigated this phenomenon for a wide range of deep learning vision architectures and with circular ensembles originating from statistical quantum…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQuantum chaos and dynamical systems · Protein Structure and Dynamics · Fractal and DNA sequence analysis
MethodsSigmoid Activation · Softmax · Tanh Activation · Long Short-Term Memory
