Substitution-based Semantic Change Detection using Contextual Embeddings

Dallas Card

arXiv:2309.02403·cs.CL·September 7, 2023

Substitution-based Semantic Change Detection using Contextual Embeddings

Dallas Card

PDF

1 Repo

TL;DR

This paper introduces a simplified, interpretable, and efficient method for detecting semantic change using contextual embeddings and substitute words, outperforming previous approaches on standard datasets.

Contribution

The paper presents a novel substitution-based approach leveraging contextual embeddings for semantic change detection, improving efficiency, interpretability, and performance.

Findings

01

Outperforms existing methods on key datasets

02

More scalable and interpretable than prior approaches

03

Enables nuanced analysis of semantic change

Abstract

Measuring semantic change has thus far remained a task where methods using contextual embeddings have struggled to improve upon simpler techniques relying only on static word vectors. Moreover, many of the previously proposed approaches suffer from downsides related to scalability and ease of interpretation. We present a simplified approach to measuring semantic change using contextual embeddings, relying only on the most probable substitutes for masked terms. Not only is this approach directly interpretable, it is also far more efficient in terms of storage, achieves superior average performance across the most frequently cited datasets for this task, and allows for more nuanced investigation of change than is possible with static word vectors.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dallascard/SBSCD
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.