Knowledge Sources for Word Sense Disambiguation

Eneko Agirre; David Martinez

arXiv:cs/0109030·cs.CL·May 23, 2007

Knowledge Sources for Word Sense Disambiguation

Eneko Agirre, David Martinez

PDF

Open Access

TL;DR

This paper analyzes the relationship between knowledge types and information sources used in Word Sense Disambiguation, comparing various algorithms to guide future system development and knowledge acquisition strategies.

Contribution

It systematizes the connection between desired knowledge types and actual information sources in WSD, and compares algorithm performances on a common test setting.

Findings

01

Comparison of algorithms on a unified test set

02

Insights into the effectiveness of different knowledge sources

03

Guidance for shifting from information-based to knowledge-based systems

Abstract

Two kinds of systems have been defined during the long history of WSD: principled systems that define which knowledge types are useful for WSD, and robust systems that use the information sources at hand, such as, dictionaries, light-weight ontologies or hand-tagged corpora. This paper tries to systematize the relation between desired knowledge types and actual information sources. We also compare the results for a wide range of algorithms that have been evaluated on a common test setting in our research group. We hope that this analysis will help change the shift from systems based on information sources to systems based on knowledge sources. This study might also shed some light on semi-automatic acquisition of desired knowledge types from existing resources.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems