Combining Trigram-based and Feature-based Methods for Context-Sensitive Spelling Correction
Andrew R. Golding, Yves Schabes (Mitsubishi Electric Research, Laboratories)

TL;DR
This paper introduces a hybrid approach called Tribayes that combines trigram-based and feature-based methods to improve context-sensitive spelling correction, outperforming existing tools like Microsoft Word's grammar checker.
Contribution
The paper presents a novel hybrid method, Tribayes, that effectively combines two different approaches for better context-sensitive spelling correction.
Findings
Tribayes outperforms individual trigram and Bayes methods.
Tribayes significantly exceeds Microsoft Word's grammar checker.
The hybrid approach effectively handles both syntactic and contextual errors.
Abstract
This paper addresses the problem of correcting spelling errors that result in valid, though unintended words (such as ``peace'' and ``piece'', or ``quiet'' and ``quite'') and also the problem of correcting particular word usage errors (such as ``amount'' and ``number'', or ``among'' and ``between''). Such corrections require contextual information and are not handled by conventional spelling programs such as Unix `spell'. First, we introduce a method called Trigrams that uses part-of-speech trigrams to encode the context. This method uses a small number of parameters compared to previous methods based on word trigrams. However, it is effectively unable to distinguish among words that have the same part of speech. For this case, an alternative feature-based method called Bayes performs better; but Bayes is less effective than Trigrams when the distinction among words depends on syntactic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Natural Language Processing Techniques · Topic Modeling
