An Improved Historical Embedding without Alignment

Xiaofei Xu; Ke Deng; Fei Hu; Li Li

arXiv:1910.08692·cs.CL·October 22, 2019

An Improved Historical Embedding without Alignment

Xiaofei Xu, Ke Deng, Fei Hu, Li Li

PDF

Open Access

TL;DR

This paper introduces a scalable, alignment-free method for encoding historical word meanings into a single dense vector space, improving the detection of semantic change over time.

Contribution

The paper presents a novel approach that eliminates the need for costly embedding alignment across time periods, enhancing efficiency and accuracy in modeling semantic evolution.

Findings

01

Outperformed three popular methods in identifying semantic change

02

Effective visualization of semantic evolution of words

03

Scalable and computationally efficient approach

Abstract

Many words have evolved in meaning as a result of cultural and social change. Understanding such changes is crucial for modelling language and cultural evolution. Low-dimensional embedding methods have shown promise in detecting words' meaning change by encoding them into dense vectors. However, when exploring semantic change of words over time, these methods require the alignment of word embeddings across different time periods. This process is computationally expensive, prohibitively time consuming and suffering from contextual variability. In this paper, we propose a new and scalable method for encoding words from different time periods into one dense vector space. This can greatly improve performance when it comes to identifying words that have changed in meaning over time. We evaluated our method on dataset from Google Books N-gram. Our method outperformed three other popular…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLanguage and cultural evolution · Topic Modeling · Authorship Attribution and Profiling