IMS at SemEval-2020 Task 1: How low can you go? Dimensionality in Lexical Semantic Change Detection
Jens Kaiser, Dominik Schlechtweg, Sean Papay, Sabine Schulte im Walde

TL;DR
This paper investigates how the choice of vector dimensionality impacts the performance of lexical semantic change detection models, demonstrating that optimizing dimensionality can significantly improve results.
Contribution
It shows that optimizing vector dimensionality in Vector Initialization alignment can outperform existing models in semantic change detection.
Findings
Optimizing dimensionality improves model performance.
Higher dimensionality can introduce frequency-induced noise.
Vector space alignment benefits from careful dimensionality selection.
Abstract
We present the results of our system for SemEval-2020 Task 1 that exploits a commonly used lexical semantic change detection model based on Skip-Gram with Negative Sampling. Our system focuses on Vector Initialization (VI) alignment, compares VI to the currently top-ranking models for Subtask 2 and demonstrates that these can be outperformed if we optimize VI dimensionality. We demonstrate that differences in performance can largely be attributed to model-specific sources of noise, and we reveal a strong relationship between dimensionality and frequency-induced noise in VI alignment. Our results suggest that lexical semantic change models integrating vector space alignment should pay more attention to the role of the dimensionality parameter.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
