Two Step CCA: A new spectral method for estimating vector models of   words

Paramveer Dhillon (University of Pennsylvania); Jordan Rodu; (University of Pennsylvania); Dean Foster (University of Pennsylvania); Lyle; Ungar (University of Pennsylvania)

arXiv:1206.6403·cs.CL·July 3, 2012·49 cites

Two Step CCA: A new spectral method for estimating vector models of words

Paramveer Dhillon (University of Pennsylvania), Jordan Rodu, (University of Pennsylvania), Dean Foster (University of Pennsylvania), Lyle, Ungar (University of Pennsylvania)

PDF

Open Access

TL;DR

This paper introduces a novel spectral method called Two Step CCA for learning low-dimensional word representations from co-occurrence data, demonstrating theoretical advantages and empirical effectiveness in NLP tasks.

Contribution

The paper proposes a new two-step CCA-based spectral method that improves sample complexity and representation quality over traditional single-step approaches.

Findings

01

Lower sample complexity than single-step methods

02

Effective in POS tagging tasks

03

Improves sentiment classification accuracy

Abstract

Unlabeled data is often used to learn representations which can be used to supplement baseline features in a supervised learner. For example, for text applications where the words lie in a very high dimensional space (the size of the vocabulary), one can learn a low rank "dictionary" by an eigen-decomposition of the word co-occurrence matrix (e.g. using PCA or CCA). In this paper, we present a new spectral method based on CCA to learn an eigenword dictionary. Our improved procedure computes two set of CCAs, the first one between the left and right contexts of the given word and the second one between the projections resulting from this CCA and the word itself. We prove theoretically that this two-step procedure has lower sample complexity than the simple single step procedure and also illustrate the empirical efficacy of our approach and the richness of representations learned by our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText and Document Classification Technologies · Natural Language Processing Techniques · Topic Modeling