Just an Update on PMING Distance for Web-based Semantic Similarity in Artificial Intelligence and Data Mining
Valentina Franzoni

TL;DR
This paper updates the PMING Distance, a semantic similarity measure based on search engine data, providing a formal algebraic definition that improves its accuracy and reflects web resource changes.
Contribution
It introduces a novel formal algebraic definition of PMING Distance, correcting previous versions and enhancing its ability to measure semantic similarity dynamically.
Findings
Provides a corrected algebraic formulation of PMING Distance
Demonstrates the measure's dynamic reflection of web resource changes
Highlights PMING as a locally normalized combination of PMI and NGD
Abstract
One of the main problems that emerges in the classic approach to semantics is the difficulty in acquisition and maintenance of ontologies and semantic annotations. On the other hand, the Internet explosion and the massive diffusion of mobile smart devices lead to the creation of a worldwide system, which information is daily checked and fueled by the contribution of millions of users who interacts in a collaborative way. Search engines, continually exploring the Web, are a natural source of information on which to base a modern approach to semantic annotation. A promising idea is that it is possible to generalize the semantic similarity, under the assumption that semantically similar terms behave similarly, and define collaborative proximity measures based on the indexing information returned by search engines. The PMING Distance is a proximity measure used in data mining and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
