How to evaluate word embeddings? On importance of data efficiency and simple supervised tasks
Stanis{\l}aw Jastrzebski, Damian Le\'sniak, Wojciech Marian Czarnecki

TL;DR
This paper advocates for evaluating word embeddings based on data efficiency and simple supervised tasks to better reflect their usefulness in transfer learning and to understand their information encoding properties.
Contribution
It proposes a new evaluation approach focusing on data efficiency and supervised tasks, providing a more comprehensive analysis of word embeddings' performance.
Findings
Word similarity and analogy information are non-linearly encoded in embeddings.
Current cosine-based unsupervised evaluations may not fully capture embedding quality.
Evaluation with varying data sizes offers new insights into embedding characteristics.
Abstract
Maybe the single most important goal of representation learning is making subsequent learning faster. Surprisingly, this fact is not well reflected in the way embeddings are evaluated. In addition, recent practice in word embeddings points towards importance of learning specialized representations. We argue that focus of word representation evaluation should reflect those trends and shift towards evaluating what useful information is easily accessible. Specifically, we propose that evaluation should focus on data efficiency and simple supervised tasks, where the amount of available data is varied and scores of a supervised model are reported for each subset (as commonly done in transfer learning). In order to illustrate significance of such analysis, a comprehensive evaluation of selected word embeddings is presented. Proposed approach yields a more complete picture and brings new…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
