Evaluation of taxonomic and neural embedding methods for calculating semantic similarity
Dongqiang Yang, Yanqin Yin

TL;DR
This paper compares taxonomic and neural embedding methods for calculating semantic similarity, revealing their strengths, limitations, and potential for combined approaches in lexical semantics.
Contribution
It provides a comprehensive comparison of taxonomic and neural methods, introduces weighting factors for taxonomic similarity, and explores their interplay with human judgments.
Findings
Taxonomic similarity relies on shortest path length without fine-tuning.
Edge-counting measures word similarity literally and metaphorically.
Combining neural embeddings with concept relations shows promising transfer learning potential.
Abstract
Modelling semantic similarity plays a fundamental role in lexical semantic applications. A natural way of calculating semantic similarity is to access handcrafted semantic networks, but similarity prediction can also be anticipated in a distributional vector space. Similarity calculation continues to be a challenging task, even with the latest breakthroughs in deep neural language models. We first examined popular methodologies in measuring taxonomic similarity, including edge-counting that solely employs semantic relations in a taxonomy, as well as the complex methods that estimate concept specificity. We further extrapolated three weighting factors in modelling taxonomic similarity. To study the distinct mechanisms between taxonomic and distributional similarity measures, we ran head-to-head comparisons of each measure with human similarity judgements from the perspectives of word…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
