Asymptotic Bayesian Generalization Error in Latent Dirichlet Allocation   and Stochastic Matrix Factorization

Naoki Hayashi; Sumio Watanabe

arXiv:1709.04212·math.ST·February 21, 2020

Asymptotic Bayesian Generalization Error in Latent Dirichlet Allocation and Stochastic Matrix Factorization

Naoki Hayashi, Sumio Watanabe

PDF

TL;DR

This paper establishes the asymptotic Bayesian generalization error for Latent Dirichlet Allocation and stochastic matrix factorization, showing they share the same RLCT and have smaller errors than regular models.

Contribution

It proves the equivalence of LDA and SMF in terms of RLCT and generalization error, providing new insights into their asymptotic behavior.

Findings

01

LDA and SMF have the same RLCT and asymptotic generalization error.

02

The RLCT upper bound is smaller than half the parameter dimension.

03

Their Bayesian generalization errors are smaller than those of regular models.

Abstract

Latent Dirichlet allocation (LDA) is useful in document analysis, image processing, and many information systems; however, its generalization performance has been left unknown because it is a singular learning machine to which regular statistical theory can not be applied. Stochastic matrix factorization (SMF) is a restricted matrix factorization in which matrix factors are stochastic; the column of the matrix is in a simplex. SMF is being applied to image recognition and text mining. We can understand SMF as a statistical model by which a stochastic matrix of given data is represented by a product of two stochastic matrices, whose generalization performance has also been left unknown because of non-regularity. In this paper, by using an algebraic and geometric method, we show the analytic equivalence of LDA and SMF, both of which have the same real log canonical threshold (RLCT),…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLinear Discriminant Analysis