Understanding the Downstream Instability of Word Embeddings
Megan Leszczynski, Avner May, Jian Zhang, Sen Wu, Christopher R., Aberger, Christopher R\'e

TL;DR
This paper investigates the instability of word embeddings in NLP, revealing a stability-memory tradeoff, introducing a new eigenspace instability measure, and demonstrating its effectiveness in predicting downstream model disagreement.
Contribution
It introduces a novel eigenspace instability measure for word embeddings and analyzes the stability-memory tradeoff, providing practical tools for minimizing downstream prediction instability.
Findings
Increasing embedding memory reduces prediction disagreement by up to 37%.
The eigenspace instability measure effectively predicts downstream model disagreement.
Stability-memory tradeoffs are consistent across different embedding types.
Abstract
Many industrial machine learning (ML) systems require frequent retraining to keep up-to-date with constantly changing data. This retraining exacerbates a large challenge facing ML systems today: model training is unstable, i.e., small changes in training data can cause significant changes in the model's predictions. In this paper, we work on developing a deeper understanding of this instability, with a focus on how a core building block of modern natural language processing (NLP) pipelines---pre-trained word embeddings---affects the instability of downstream NLP models. We first empirically reveal a tradeoff between stability and memory: increasing the embedding memory 2x can reduce the disagreement in predictions due to small changes in training data by 5% to 37% (relative). To theoretically explain this tradeoff, we introduce a new measure of embedding instability---the eigenspace…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Software Engineering Research
