Theory of overparametrization in quantum neural networks
Martin Larocca, Nathan Ju, Diego Garc\'ia-Mart\'in, Patrick J. Coles,, M. Cerezo

TL;DR
This paper provides a rigorous analysis of overparametrization in quantum neural networks, showing how increasing parameters beyond a critical point improves trainability and capacity, with implications for quantum algorithms.
Contribution
It establishes a theoretical framework linking overparametrization to Lie algebra dimension and Fisher information, revealing a phase transition in QNN trainability and capacity.
Findings
Overparametrization reduces local minima in the loss landscape.
A critical parameter number $M_c$ bounds QNN capacity and Fisher information.
Numerical simulations confirm the phase transition in various applications.
Abstract
The prospect of achieving quantum advantage with Quantum Neural Networks (QNNs) is exciting. Understanding how QNN properties (e.g., the number of parameters ) affect the loss landscape is crucial to the design of scalable QNN architectures. Here, we rigorously analyze the overparametrization phenomenon in QNNs with periodic structure. We define overparametrization as the regime where the QNN has more than a critical number of parameters that allows it to explore all relevant directions in state space. Our main results show that the dimension of the Lie algebra obtained from the generators of the QNN is an upper bound for , and for the maximal rank that the quantum Fisher information and Hessian matrices can reach. Underparametrized QNNs have spurious local minima in the loss landscape that start disappearing when . Thus, the overparametrization onset…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQuantum Computing Algorithms and Architecture · Quantum Information and Cryptography · Neural Networks and Reservoir Computing
