IMS at SemEval-2020 Task 1: How low can you go? Dimensionality in   Lexical Semantic Change Detection

Jens Kaiser; Dominik Schlechtweg; Sean Papay; Sabine Schulte im Walde

arXiv:2008.03164·cs.CL·August 10, 2020

IMS at SemEval-2020 Task 1: How low can you go? Dimensionality in Lexical Semantic Change Detection

Jens Kaiser, Dominik Schlechtweg, Sean Papay, Sabine Schulte im Walde

PDF

TL;DR

This paper investigates how the choice of vector dimensionality impacts the performance of lexical semantic change detection models, demonstrating that optimizing dimensionality can significantly improve results.

Contribution

It shows that optimizing vector dimensionality in Vector Initialization alignment can outperform existing models in semantic change detection.

Findings

01

Optimizing dimensionality improves model performance.

02

Higher dimensionality can introduce frequency-induced noise.

03

Vector space alignment benefits from careful dimensionality selection.

Abstract

We present the results of our system for SemEval-2020 Task 1 that exploits a commonly used lexical semantic change detection model based on Skip-Gram with Negative Sampling. Our system focuses on Vector Initialization (VI) alignment, compares VI to the currently top-ranking models for Subtask 2 and demonstrates that these can be outperformed if we optimize VI dimensionality. We demonstrate that differences in performance can largely be attributed to model-specific sources of noise, and we reveal a strong relationship between dimensionality and frequency-induced noise in VI alignment. Our results suggest that lexical semantic change models integrating vector space alignment should pay more attention to the role of the dimensionality parameter.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.