Real Multi-Sense or Pseudo Multi-Sense: An Approach to Improve Word   Representation

Haoyue Shi; Caihua Li; Junfeng Hu

arXiv:1701.01574·cs.CL·January 9, 2017·1 cites

Real Multi-Sense or Pseudo Multi-Sense: An Approach to Improve Word Representation

Haoyue Shi, Caihua Li, Junfeng Hu

PDF

Open Access

TL;DR

This paper introduces a method to identify and eliminate pseudo multi-sense in multi-sense word embeddings, enhancing their quality by reducing redundant representations of the same meaning.

Contribution

It proposes a novel algorithm to detect pseudo multi-sense and refines existing embeddings, improving their performance on word similarity and analogy tasks.

Findings

01

Refined embeddings outperform original ones on similarity tasks

02

Eliminating pseudo multi-sense improves embedding quality

03

Method reduces linguistic complexity in word representations

Abstract

Previous researches have shown that learning multiple representations for polysemous words can improve the performance of word embeddings on many tasks. However, this leads to another problem. Several vectors of a word may actually point to the same meaning, namely pseudo multi-sense. In this paper, we introduce the concept of pseudo multi-sense, and then propose an algorithm to detect such cases. With the consideration of the detected pseudo multi-sense cases, we try to refine the existing word embeddings to eliminate the influence of pseudo multi-sense. Moreover, we apply our algorithm on previous released multi-sense word embeddings and tested it on artificial word similarity tasks and the analogy task. The result of the experiments shows that diminishing pseudo multi-sense can improve the quality of word representations. Thus, our method is actually an efficient way to reduce…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Advanced Text Analysis Techniques