SimLex-999: Evaluating Semantic Models with (Genuine) Similarity Estimation
Felix Hill, Roi Reichart, Anna Korhonen

TL;DR
SimLex-999 is a new benchmark resource that specifically measures semantic similarity, encouraging the development of models that better capture genuine similarity rather than mere association.
Contribution
It introduces a gold standard dataset focused on true semantic similarity, with diverse concept types and detailed ratings, enabling more precise evaluation of semantic models.
Findings
State-of-the-art models perform below inter-annotator agreement on SimLex-999.
SimLex-999 enables detailed analysis of model performance across different concept types.
The resource encourages development of models with a broader range of semantic understanding.
Abstract
We present SimLex-999, a gold standard resource for evaluating distributional semantic models that improves on existing resources in several important ways. First, in contrast to gold standards such as WordSim-353 and MEN, it explicitly quantifies similarity rather than association or relatedness, so that pairs of entities that are associated but not actually similar [Freud, psychology] have a low rating. We show that, via this focus on similarity, SimLex-999 incentivizes the development of models with a different, and arguably wider range of applications than those which reflect conceptual association. Second, SimLex-999 contains a range of concrete and abstract adjective, noun and verb pairs, together with an independent rating of concreteness and (free) association strength for each pair. This diversity enables fine-grained analyses of the performance of models on concepts of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies
