A Bayesian hybrid method for context-sensitive spelling correction

Andrew R. Golding (Mitsubishi Electric Research Laboratories)

arXiv:cmp-lg/9606001·cmp-lg·February 3, 2008·67 cites

A Bayesian hybrid method for context-sensitive spelling correction

Andrew R. Golding (Mitsubishi Electric Research Laboratories)

PDF

Open Access

TL;DR

This paper introduces a Bayesian hybrid approach for context-sensitive spelling correction that combines multiple evidence sources, outperforming previous decision list methods by utilizing all available evidence for better accuracy.

Contribution

The paper presents a novel Bayesian hybrid method that improves spelling correction by integrating all evidence sources, advancing beyond existing decision list techniques.

Findings

01

Bayesian hybrid method outperforms decision list approaches

02

Utilizing all evidence improves correction accuracy

03

Demonstrated performance gains on spelling correction tasks

Abstract

Two classes of methods have been shown to be useful for resolving lexical ambiguity. The first relies on the presence of particular words within some distance of the ambiguous target word; the second uses the pattern of words and part-of-speech tags around the target word. These methods have complementary coverage: the former captures the lexical ``atmosphere'' (discourse topic, tense, etc.), while the latter captures local syntax. Yarowsky has exploited this complementarity by combining the two methods using decision lists. The idea is to pool the evidence provided by the component methods, and to then solve a target problem by applying the single strongest piece of evidence, whatever type it happens to be. This paper takes Yarowsky's work as a starting point, applying decision lists to the problem of context-sensitive spelling correction. Decision lists are found, by and large, to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems