TL;DR
This paper introduces a method linking diachronic word embeddings to documents to identify those with progressive semantic changes, showing that such documents tend to receive more citations, indicating higher influence.
Contribution
The paper presents a novel approach to quantify semantic progressiveness in word usage and aggregate this to score documents, linking language change to influence.
Findings
Semantically progressive documents receive more citations.
The method effectively identifies influential documents based on language change.
Link between semantic change and document influence is empirically demonstrated.
Abstract
Diachronic word embeddings -- vector representations of words over time -- offer remarkable insights into the evolution of language and provide a tool for quantifying sociocultural change from text documents. Prior work has used such embeddings to identify shifts in the meaning of individual words. However, simply knowing that a word has changed in meaning is insufficient to identify the instances of word usage that convey the historical or the newer meaning. In this paper, we link diachronic word embeddings to documents, by situating those documents as leaders or laggards with respect to ongoing semantic changes. Specifically, we propose a novel method to quantify the degree of semantic progressiveness in each word usage, and then show how these usages can be aggregated to obtain scores for each document. We analyze two large collections of documents, representing legal opinions and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
