Naive Bayes and Exemplar-Based approaches to Word Sense Disambiguation Revisited
Gerard Escudero, Lluis Marquez, German Rigau

TL;DR
This paper compares Naive Bayes and Exemplar-based methods for Word Sense Disambiguation, clarifies previous misconceptions, proposes an efficiency improvement, and finds Exemplar-based approaches generally outperform Bayesian ones.
Contribution
It provides a detailed comparison of Naive Bayes and Exemplar-based methods, introduces an efficiency enhancement, and clarifies conflicting literature findings.
Findings
Exemplar-based approach generally outperforms Naive Bayes in WSD.
Using only positive information improves efficiency without accuracy loss.
The proposed method scales well with large attribute sets.
Abstract
This paper describes an experimental comparison between two standard supervised learning methods, namely Naive Bayes and Exemplar-based classification, on the Word Sense Disambiguation (WSD) problem. The aim of the work is twofold. Firstly, it attempts to contribute to clarify some confusing information about the comparison between both methods appearing in the related literature. In doing so, several directions have been explored, including: testing several modifications of the basic learning algorithms and varying the feature space. Secondly, an improvement of both algorithms is proposed, in order to deal with large attribute sets. This modification, which basically consists in using only the positive information appearing in the examples, allows to improve greatly the efficiency of the methods, with no loss in accuracy. The experiments have been performed on the largest sense-tagged…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Algorithms and Data Compression
