Universal scaling laws in quantum-probabilistic machine learning by   tensor network towards interpreting representation and generalization powers

Sheng-Chen Bai; Shi-Ju Ran

arXiv:2410.09703·quant-ph·March 26, 2025

Universal scaling laws in quantum-probabilistic machine learning by tensor network towards interpreting representation and generalization powers

Sheng-Chen Bai, Shi-Ju Ran

PDF

Open Access

TL;DR

This paper uncovers universal scaling laws in quantum-probabilistic machine learning, revealing how the negative log-likelihood scales with features and training samples, and linking these laws to the model's representation and generalization capabilities.

Contribution

It introduces the concept of universal scaling laws in quantum-probabilistic ML, connecting the scaling behavior to the model's representation and generalization powers, and analyzing the role of orthogonality.

Findings

01

Negative log-likelihood scales linearly with features in untrained models.

02

Quadratic correction in NLL indicates generalization and representation power.

03

Over-parameterization is detectable through deviations in scaling coefficients.

Abstract

Interpreting the representation and generalization powers has been a long-standing issue in the field of machine learning (ML) and artificial intelligence. This work contributes to uncovering the emergence of universal scaling laws in quantum-probabilistic ML. We take the generative tensor network (GTN) in the form of a matrix product state as an example and show that with an untrained GTN (such as a random TN state), the negative logarithmic likelihood (NLL) $L$ generally increases linearly with the number of features $M$ , i.e., $L ≃ k M + co n s t$ . This is a consequence of the so-called ``catastrophe of orthogonality,'' which states that quantum many-body states tend to become exponentially orthogonal to each other as $M$ increases. We reveal that while gaining information through training, the linear scaling law is suppressed by a negative quadratic correction, leading to $L…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsQuantum Computing Algorithms and Architecture · Neural Networks and Applications · Quantum Information and Cryptography

MethodsSparse Evolutionary Training