Understanding the Downstream Instability of Word Embeddings

Megan Leszczynski; Avner May; Jian Zhang; Sen Wu; Christopher R.; Aberger; Christopher R\'e

arXiv:2003.04983·cs.CL·March 12, 2020·6 cites

Understanding the Downstream Instability of Word Embeddings

Megan Leszczynski, Avner May, Jian Zhang, Sen Wu, Christopher R., Aberger, Christopher R\'e

PDF

Open Access 1 Repo

TL;DR

This paper investigates the instability of word embeddings in NLP, revealing a stability-memory tradeoff, introducing a new eigenspace instability measure, and demonstrating its effectiveness in predicting downstream model disagreement.

Contribution

It introduces a novel eigenspace instability measure for word embeddings and analyzes the stability-memory tradeoff, providing practical tools for minimizing downstream prediction instability.

Findings

01

Increasing embedding memory reduces prediction disagreement by up to 37%.

02

The eigenspace instability measure effectively predicts downstream model disagreement.

03

Stability-memory tradeoffs are consistent across different embedding types.

Abstract

Many industrial machine learning (ML) systems require frequent retraining to keep up-to-date with constantly changing data. This retraining exacerbates a large challenge facing ML systems today: model training is unstable, i.e., small changes in training data can cause significant changes in the model's predictions. In this paper, we work on developing a deeper understanding of this instability, with a focus on how a core building block of modern natural language processing (NLP) pipelines---pre-trained word embeddings---affects the instability of downstream NLP models. We first empirically reveal a tradeoff between stability and memory: increasing the embedding memory 2x can reduce the disagreement in predictions due to small changes in training data by 5% to 37% (relative). To theoretically explain this tradeoff, we introduce a new measure of embedding instability---the eigenspace…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

HazyResearch/anchor-stability
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Software Engineering Research