Computing Word Classes Using Spectral Clustering
Effi Levi, Saggy Herman, Ari Rappoport

TL;DR
This paper explores the application of spectral clustering to word lexicons in NLP, comparing its effectiveness to Brown clustering and other methods for tasks like semantic role labeling and dependency parsing.
Contribution
It introduces spectral clustering as a novel approach for word clustering in NLP and evaluates its performance against established methods.
Findings
Spectral clusters perform similarly to Brown clusters in NLP tasks.
Spectral clustering outperforms other clustering methods.
Spectral and Brown clusters capture complementary information.
Abstract
Clustering a lexicon of words is a well-studied problem in natural language processing (NLP). Word clusters are used to deal with sparse data in statistical language processing, as well as features for solving various NLP tasks (text categorization, question answering, named entity recognition and others). Spectral clustering is a widely used technique in the field of image processing and speech recognition. However, it has scarcely been explored in the context of NLP; specifically, the method used in this (Meila and Shi, 2001) has never been used to cluster a general word lexicon. We apply spectral clustering to a lexicon of words, evaluating the resulting clusters by using them as features for solving two classical NLP tasks: semantic role labeling and dependency parsing. We compare performance with Brown clustering, a widely-used technique for word clustering, as well as with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text and Document Classification Technologies
MethodsSpectral Clustering
