Kannada Spell Checker with Sandhi Splitter
A N Akshatha, Chandana G Upadhyaya, Rajashekara S Murthy

TL;DR
This paper presents a novel Kannada spell checker with an integrated sandhi splitter that improves error detection and correction, especially for complex words, and demonstrates significant efficiency and accuracy improvements over previous methods.
Contribution
It introduces the first sandhi splitter for Kannada, a novel algorithm, and an integrated spell checker that enhances performance and can be extended to other Indian languages.
Findings
Twice as fast as previous spell checkers
200 times more space efficient
90% accuracy on complex nouns
Abstract
Spelling errors are introduced in text either during typing, or when the user does not know the correct phoneme or grapheme. If a language contains complex words like sandhi where two or more morphemes join based on some rules, spell checking becomes very tedious. In such situations, having a spell checker with sandhi splitter which alerts the user by flagging the errors and providing suggestions is very useful. A novel algorithm of sandhi splitting is proposed in this paper. The sandhi splitter can split about 7000 most common sandhi words in Kannada language used as test samples. The sandhi splitter was integrated with a Kannada spell checker and a mechanism for generating suggestions was added. A comprehensive, platform independent, standalone spell checker with sandhi splitter application software was thus developed and tested extensively for its efficiency and correctness. A…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Speech Recognition and Synthesis · Handwritten Text Recognition Techniques
