On the Linearity of Semantic Change: Investigating Meaning Variation via   Dynamic Graph Models

Steffen Eger; Alexander Mehler

arXiv:1704.02497·cs.CL·April 11, 2017·1 cites

On the Linearity of Semantic Change: Investigating Meaning Variation via Dynamic Graph Models

Steffen Eger, Alexander Mehler

PDF

Open Access

TL;DR

This paper introduces two graph-based models to analyze semantic change over time, revealing that meaning variation follows linear patterns in embedding space and self-similarity decay across three languages.

Contribution

It proposes novel linear models for semantic change and demonstrates their validity across multiple languages and datasets.

Findings

01

Semantic change is linear in embedding space.

02

Self-similarity of words decays linearly over time.

03

Models apply successfully to multilingual corpora.

Abstract

We consider two graph models of semantic change. The first is a time-series model that relates embedding vectors from one time period to embedding vectors of previous time periods. In the second, we construct one graph for each word: nodes in this graph correspond to time points and edge weights to the similarity of the word's meaning across two time points. We apply our two models to corpora across three different languages. We find that semantic change is linear in two senses. Firstly, today's embedding vectors (= meaning) of words can be derived as linear combinations of embedding vectors of their neighbors in previous time periods. Secondly, self-similarity of words decays linearly in time. We consider both findings as new laws/hypotheses of semantic change.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLanguage and cultural evolution · Topic Modeling · Natural Language Processing Techniques