Generalized Grade-of-Membership Estimation for High-dimensional Locally Dependent Data
Ling Chen, Chengzhu Huang, Yuqi Gu

TL;DR
This paper introduces a scalable spectral method for estimating generalized Grade-of-Membership models in high-dimensional, locally dependent categorical data, overcoming limitations of traditional Bayesian approaches.
Contribution
The authors propose a novel flattening and spectral decomposition approach with theoretical error bounds for high-dimensional, locally dependent data in generalized GoM models.
Findings
Method outperforms existing approaches in simulations.
Applicable to diverse data types like surveys and genetics.
Provides finite-sample error guarantees.
Abstract
This work focuses on the mixed membership models for multivariate categorical data widely used for analyzing survey responses and population genetics data. These grade of membership (GoM) models offer rich modeling power but present significant estimation challenges for high-dimensional polytomous data. Popular existing approaches, such as Bayesian MCMC inference, are not scalable and lack theoretical guarantees in high-dimensional settings. To address this, we first observe that data from this model can be reformulated as a three-way (quasi-)tensor, with many subjects responding to many items with varying numbers of categories. We introduce a novel and simple approach that flattens the three-way quasi-tensor into a "fat" matrix, and then perform a singular value decomposition of it to estimate parameters by exploiting the singular subspace geometry. Our fast spectral method can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference
