Phonetic based SoundEx & ShapeEx algorithm for Sindhi Spell Checker System
Zeeshan Bhatti, Ahmad Waqas, Imdad Ali Ismaili, Dil Nawaz Hakro,, Waseem Javaid Soomro

TL;DR
This paper introduces a novel phonetic and shape-based algorithm for Sindhi spell checking, addressing language-specific challenges to improve accuracy and efficiency in spell correction suggestions.
Contribution
It develops the first combined SoundEx and ShapeEx algorithms tailored for Sindhi, creating unique character categorization tables for phonetic and glyph similarity.
Findings
Enhanced suggestion accuracy for misspelled Sindhi words
First-ever Sindhi-specific phonetic and shape categorization tables
Improved efficiency in Sindhi spell checking system
Abstract
This paper presents a novel combinational phonetic algorithm for Sindhi Language, to be used in developing Sindhi Spell Checker which has yet not been developed prior to this work. The compound textual forms and glyphs of Sindhi language presents a substantial challenge for developing Sindhi spell checker system and generating similar suggestion list for misspelled words. In order to implement such a system, phonetic based Sindhi language rules and patterns must be considered into account for increasing the accuracy and efficiency. The proposed system is developed with a blend between Phonetic based SoundEx algorithm and ShapeEx algorithm for pattern or glyph matching, generating accurate and efficient suggestion list for incorrect or misspelled Sindhi words. A table of phonetically similar sounding Sindhi characters for SoundEx algorithm is also generated along with another table…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Speech and Audio Processing
