Context-aware Stand-alone Neural Spelling Correction

Xiangci Li; Hairong Liu; Liang Huang

arXiv:2011.06642·cs.CL·May 19, 2025·1 cites

Context-aware Stand-alone Neural Spelling Correction

Xiangci Li, Hairong Liu, Liang Huang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a context-aware neural approach for stand-alone spelling correction that leverages pre-trained language models to jointly detect and correct misspellings, significantly outperforming previous methods.

Contribution

It proposes a novel sequence labeling framework using fine-tuned pre-trained models for stand-alone spelling correction, focusing solely on spelling errors without token insertion or deletion.

Findings

01

Outperforms previous state-of-the-art by 12.8% absolute F0.5 score

02

Effectively detects and corrects misspellings using context-aware modeling

03

Demonstrates the effectiveness of joint detection and correction approach

Abstract

Existing natural language processing systems are vulnerable to noisy inputs resulting from misspellings. On the contrary, humans can easily infer the corresponding correct words from their misspellings and surrounding context. Inspired by this, we address the stand-alone spelling correction problem, which only corrects the spelling of each token without additional token insertion or deletion, by utilizing both spelling information and global context representations. We present a simple yet powerful solution that jointly detects and corrects misspellings as a sequence labeling task by fine-turning a pre-trained language model. Our solution outperforms the previous state-of-the-art result by 12.8% absolute F0.5 score.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jacklxc/StandAloneSpellingCorrection
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems