WiC: the Word-in-Context Dataset for Evaluating Context-Sensitive Meaning Representations
Mohammad Taher Pilehvar, Jose Camacho-Collados

TL;DR
The paper introduces WiC, a large-scale dataset designed to evaluate the ability of models to understand words' meanings in different contexts, addressing limitations of existing benchmarks.
Contribution
It presents WiC, a new dataset for evaluating context-sensitive word representations, filling a gap left by the shortcomings of previous benchmarks.
Findings
Existing models surpass the Stanford Contextual Word Similarity performance ceiling.
WiC provides a more suitable benchmark for dynamic semantics evaluation.
WiC is curated by experts and publicly available.
Abstract
By design, word embeddings are unable to model the dynamic nature of words' semantics, i.e., the property of words to correspond to potentially different meanings. To address this limitation, dozens of specialized meaning representation techniques such as sense or contextualized embeddings have been proposed. However, despite the popularity of research on this topic, very few evaluation benchmarks exist that specifically focus on the dynamic semantics of words. In this paper we show that existing models have surpassed the performance ceiling of the standard evaluation dataset for the purpose, i.e., Stanford Contextual Word Similarity, and highlight its shortcomings. To address the lack of a suitable benchmark, we put forward a large-scale Word in Context dataset, called WiC, based on annotations curated by experts, for generic evaluation of context-sensitive representations. WiC is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗google-t5/t5-smallmodel· 1.9M dl· ♡ 5381.9M dl♡ 538
- 🤗google-t5/t5-largemodel· 451k dl· ♡ 253451k dl♡ 253
- 🤗google-t5/t5-11bmodel· 22k dl· ♡ 6922k dl♡ 69
- 🤗google-t5/t5-3bmodel· 428k dl· ♡ 52428k dl♡ 52
- 🤗google-t5/t5-basemodel· 1.8M dl· ♡ 7701.8M dl♡ 770
- 🤗Kamrani/t5-largemodel· 6 dl6 dl
- 🤗qiaoyi/Comment_Summarization4DesignTutormodel· 11 dl11 dl
- 🤗ybelkada/t5-11b-shardedmodel· 11 dl· ♡ 211 dl♡ 2
- 🤗michellehbn/brrrrmodel· ♡ 1♡ 1
- 🤗BrainStormersHakton/question-gen-T5-basemodel· 3 dl3 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Misinformation and Its Impacts · Sentiment Analysis and Opinion Mining
