Blind signal decomposition of various word embeddings based on join and   individual variance explained

Yikai Wang; Weijian Li

arXiv:2011.14496·cs.CL·December 1, 2020

Blind signal decomposition of various word embeddings based on join and individual variance explained

Yikai Wang, Weijian Li

PDF

Open Access

TL;DR

This paper introduces a joint signal decomposition method, JIVE, to analyze and fuse different trained word embeddings, revealing their shared and unique features and improving sentiment analysis performance.

Contribution

The paper applies JIVE to decompose word embeddings into joint and individual components, offering new insights and methods for enhancing NLP tasks.

Findings

01

Decomposition improves sentiment analysis accuracy.

02

Mapping to joint components enhances lower-performing embeddings.

03

Concatenating components yields better model performance.

Abstract

In recent years, natural language processing (NLP) has become one of the most important areas with various applications in human's life. As the most fundamental task, the field of word embedding still requires more attention and research. Currently, existing works about word embedding are focusing on proposing novel embedding algorithms and dimension reduction techniques on well-trained word embeddings. In this paper, we propose to use a novel joint signal separation method - JIVE to jointly decompose various trained word embeddings into joint and individual components. Through this decomposition framework, we can easily investigate the similarity and difference among different word embeddings. We conducted extensive empirical study on word2vec, FastText and GLoVE trained on different corpus and with different dimensions. We compared the performance of different decomposed components…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Sentiment Analysis and Opinion Mining · Advanced Text Analysis Techniques

MethodsfastText · GloVe Embeddings