Correspondence Analysis and PMI-Based Word Embeddings: A Comparative Study
Qianqian Qi, Ayoub Bagheri, David J. Hessen, Peter G. M. van der Heijden

TL;DR
This paper explores the mathematical connection between correspondence analysis and PMI-based word embeddings, proposing variants that improve performance and comparing them with BERT across multiple benchmarks.
Contribution
It establishes a formal link between CA and PMI-based embeddings and introduces ROOT-CA and ROOTROOT-CA variants that enhance embedding quality.
Findings
ROOT-CA and ROOTROOT-CA outperform standard PMI methods
Variants achieve results competitive with BERT
Performance depends on handling extreme matrix values
Abstract
Popular word embedding methods such as GloVe and Word2Vec are related to the factorization of the pointwise mutual information (PMI) matrix. In this paper, we establish a formal connection between correspondence analysis (CA) and PMI-based word embedding methods. CA is a dimensionality reduction method that uses singular value decomposition (SVD), and we show that CA is mathematically close to the weighted factorization of the PMI matrix. We further introduce variants of CA for word-context matrices, namely CA applied after a square-root transformation (ROOT-CA) and after a fourth-root transformation (ROOTROOT-CA). We analyze the performance of these methods and examine how their success or failure is influenced by extreme values in the decomposed matrix. Although our primary focus is on traditionalstatic word embedding methods, we also include a comparison with a transformer-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Dropout · Dense Connections · Layer Normalization · Adam · Attention Dropout · Linear Layer · Weight Decay · Linear Warmup With Linear Decay
