Machine Learning Driven Smishing Detection Framework for Mobile Security
Diksha Goel, Hussain Ahmad, Ankit Kumar Jain, Nikhil Kumar Goel

TL;DR
This paper introduces a machine learning framework that uses text normalization to improve smishing detection accuracy on mobile devices, achieving over 96% accuracy and outperforming existing methods.
Contribution
The paper presents an enhanced content-based smishing detection framework utilizing advanced text normalization to boost classifier performance, especially Naive Bayesian, in identifying smishing messages.
Findings
Detection accuracy of 96.2%
False Positive Rate of 3.87%
False Negative Rate of 2.85%
Abstract
The increasing reliance on smartphones for communication, financial transactions, and personal data management has made them prime targets for cyberattacks, particularly smishing, a sophisticated variant of phishing conducted via SMS. Despite the growing threat, traditional detection methods often struggle with the informal and evolving nature of SMS language, which includes abbreviations, slang, and short forms. This paper presents an enhanced content-based smishing detection framework that leverages advanced text normalization techniques to improve detection accuracy. By converting nonstandard text into its standardized form, the proposed model enhances the efficacy of machine learning classifiers, particularly the Naive Bayesian classifier, in distinguishing smishing messages from legitimate ones. Our experimental results, validated on a publicly available dataset, demonstrate a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
