A Bayesian hybrid method for context-sensitive spelling correction
Andrew R. Golding (Mitsubishi Electric Research Laboratories)

TL;DR
This paper introduces a Bayesian hybrid approach for context-sensitive spelling correction that combines multiple evidence sources, outperforming previous decision list methods by utilizing all available evidence for better accuracy.
Contribution
The paper presents a novel Bayesian hybrid method that improves spelling correction by integrating all evidence sources, advancing beyond existing decision list techniques.
Findings
Bayesian hybrid method outperforms decision list approaches
Utilizing all evidence improves correction accuracy
Demonstrated performance gains on spelling correction tasks
Abstract
Two classes of methods have been shown to be useful for resolving lexical ambiguity. The first relies on the presence of particular words within some distance of the ambiguous target word; the second uses the pattern of words and part-of-speech tags around the target word. These methods have complementary coverage: the former captures the lexical ``atmosphere'' (discourse topic, tense, etc.), while the latter captures local syntax. Yarowsky has exploited this complementarity by combining the two methods using decision lists. The idea is to pool the evidence provided by the component methods, and to then solve a target problem by applying the single strongest piece of evidence, whatever type it happens to be. This paper takes Yarowsky's work as a starting point, applying decision lists to the problem of context-sensitive spelling correction. Decision lists are found, by and large, to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems
