A New Spectral Method for Latent Variable Models

Matteo Ruffini; Marta Casanellas; Ricard Gavald\`a

arXiv:1612.03409·stat.ML·April 5, 2017·1 cites

A New Spectral Method for Latent Variable Models

Matteo Ruffini, Marta Casanellas, Ricard Gavald\`a

PDF

Open Access 1 Repo

TL;DR

This paper introduces a spectral decomposition-based algorithm for unsupervised learning of latent variable models, demonstrating robustness and efficiency in parameter estimation for text mining applications like topic models and LDA.

Contribution

It presents a novel spectral method that improves parameter learning in latent variable models, with practical algorithms for text mining models such as single topic and LDA.

Findings

01

Robustness of the spectral method in theory and practice

02

Effective parameter retrieval for text models

03

Successful application to real-world text data

Abstract

This paper presents an algorithm for the unsupervised learning of latent variable models from unlabeled sets of data. We base our technique on spectral decomposition, providing a technique that proves to be robust both in theory and in practice. We also describe how to use this algorithm to learn the parameters of two well known text mining models: single topic model and Latent Dirichlet Allocation, providing in both cases an efficient technique to retrieve the parameters to feed the algorithm. We compare the results of our algorithm with those of existing algorithms on synthetic data, and we provide examples of applications to real world text corpora for both single topic model and LDA, obtaining meaningful results.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mruffini/SpectralMethod
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText and Document Classification Technologies · Natural Language Processing Techniques · Topic Modeling

MethodsLinear Discriminant Analysis