Universal scaling laws in quantum-probabilistic machine learning by tensor network towards interpreting representation and generalization powers
Sheng-Chen Bai, Shi-Ju Ran

TL;DR
This paper uncovers universal scaling laws in quantum-probabilistic machine learning, revealing how the negative log-likelihood scales with features and training samples, and linking these laws to the model's representation and generalization capabilities.
Contribution
It introduces the concept of universal scaling laws in quantum-probabilistic ML, connecting the scaling behavior to the model's representation and generalization powers, and analyzing the role of orthogonality.
Findings
Negative log-likelihood scales linearly with features in untrained models.
Quadratic correction in NLL indicates generalization and representation power.
Over-parameterization is detectable through deviations in scaling coefficients.
Abstract
Interpreting the representation and generalization powers has been a long-standing issue in the field of machine learning (ML) and artificial intelligence. This work contributes to uncovering the emergence of universal scaling laws in quantum-probabilistic ML. We take the generative tensor network (GTN) in the form of a matrix product state as an example and show that with an untrained GTN (such as a random TN state), the negative logarithmic likelihood (NLL) generally increases linearly with the number of features , i.e., . This is a consequence of the so-called ``catastrophe of orthogonality,'' which states that quantum many-body states tend to become exponentially orthogonal to each other as increases. We reveal that while gaining information through training, the linear scaling law is suppressed by a negative quadratic correction, leading to $L…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQuantum Computing Algorithms and Architecture · Neural Networks and Applications · Quantum Information and Cryptography
MethodsSparse Evolutionary Training
