A Parametric Similarity Method: Comparative Experiments based on Semantically Annotated Large Datasets
Antonio De Nicola, Anna Formica, Michele Missikoff, Elaheh Pourabbas,, Francesco Taglino

TL;DR
This paper introduces SemSimp, a parametric semantic similarity method leveraging ontologies and information content, validated through experiments on large datasets showing superior performance over existing methods.
Contribution
The paper proposes SemSimp, a novel, configurable semantic similarity approach that outperforms existing methods in large-scale, real-world dataset evaluations.
Findings
SemSimp outperforms other similarity methods in ACM dataset experiments.
SemSimp achieves higher correlation with human judgments.
The method is effective across different domains, including physics.
Abstract
We present the parametric method SemSimp aimed at measuring semantic similarity of digital resources. SemSimp is based on the notion of information content, and it leverages a reference ontology and taxonomic reasoning, encompassing different approaches for weighting the concepts of the ontology. In particular, weights can be computed by considering either the available digital resources or the structure of the reference ontology of a given domain. SemSimp is assessed against six representative semantic similarity methods for comparing sets of concepts proposed in the literature, by carrying out an experimentation that includes both a statistical analysis and an expert judgement evaluation. To the purpose of achieving a reliable assessment, we used a real-world large dataset based on the Digital Library of the Association for Computing Machinery (ACM), and a reference ontology derived…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLib · Ontology
