Post-Processing of Word Representations via Variance Normalization and   Dynamic Embedding

Bin Wang; Fenxiao Chen; Angela Wang; C.-C. Jay Kuo

arXiv:1808.06305·cs.CL·February 18, 2020·1 cites

Post-Processing of Word Representations via Variance Normalization and Dynamic Embedding

Bin Wang, Fenxiao Chen, Angela Wang, C.-C. Jay Kuo

PDF

Open Access 1 Repo

TL;DR

This paper introduces two novel post-processing techniques, PVN and PDE, that enhance word embeddings by normalizing variance and capturing sequence order, leading to improved NLP performance.

Contribution

The paper proposes two new post-processing methods, PVN and PDE, which improve word embeddings by normalizing variance and modeling sequence order, respectively.

Findings

01

PVN improves embedding quality by variance normalization.

02

PDE captures sequence order information in embeddings.

03

Combined PVN and PDE outperform baseline embeddings.

Abstract

Although embedded vector representations of words offer impressive performance on many natural language processing (NLP) applications, the information of ordered input sequences is lost to some extent if only context-based samples are used in the training. For further performance improvement, two new post-processing techniques, called post-processing via variance normalization (PVN) and post-processing via dynamic embedding (PDE), are proposed in this work. The PVN method normalizes the variance of principal components of word vectors while the PDE method learns orthogonal latent variables from ordered input sequences. The PVN and the PDE methods can be integrated to achieve better performance. We apply these post-processing techniques to two popular word embedding methods (i.e., word2vec and GloVe) to yield their post-processed representations. Extensive experiments are conducted to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

BinWang28/PVN-Post-Processing-of-word-representation-via-variance-normalization
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis