Semantic Similarity in a Taxonomy: An Information-Based Measure and its   Application to Problems of Ambiguity in Natural Language

P. Resnik

arXiv:1105.5444·cs.AI·May 30, 2011

Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language

P. Resnik

PDF

TL;DR

This paper introduces an information-based semantic similarity measure within taxonomies, outperforming traditional methods, and applies it to resolve ambiguity in natural language through algorithms validated by experiments.

Contribution

It proposes a novel shared information content measure for semantic similarity in taxonomies and demonstrates its effectiveness in ambiguity resolution tasks.

Findings

01

The information-based measure outperforms edge-counting methods.

02

Algorithms leveraging taxonomic similarity effectively resolve ambiguity.

03

Experimental results confirm improved performance in natural language tasks.

Abstract

This article presents a measure of semantic similarity in an IS-A taxonomy based on the notion of shared information content. Experimental evaluation against a benchmark set of human similarity judgments demonstrates that the measure performs better than the traditional edge-counting approach. The article presents algorithms that take advantage of taxonomic similarity in resolving syntactic and semantic ambiguity, along with experimental results demonstrating their effectiveness.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.