A Synthetic Conversational Smishing Dataset for Social Engineering Detection
Carl Lochstampfor, Ayan Roy

TL;DR
This paper introduces a synthetic dataset of multi-turn conversational smishing attacks and evaluates various models, highlighting the effectiveness of lexical features and the challenges faced by transformer architectures.
Contribution
The creation of a large, labeled synthetic dataset for multi-stage conversational smishing detection and baseline evaluations of multiple models on this dataset.
Findings
TF-IDF models outperform engineered feature models.
XGBoost with TF-IDF achieves 72.5% accuracy.
Transformer models are limited by input length and data size.
Abstract
Smishing (SMS phishing) has become a serious cybersecurity threat, especially for elderly and cyber-unaware individuals, causing financial loss and undermining user trust. Although prior work has focused on detecting smishing at the level of individual messages, real-world attackers often rely on multi-stage social engineering, gradually manipulating victims through extended conversations before attempting to steal sensitive information. Despite the existence of several datasets for single-message smishing detection, datasets capturing conversational smishing remain largely unavailable, limiting research on multi-turn attack detection. To address this gap, this paper presents a synthetically generated dataset of 3,201 labeled multi-round conversations designed to emulate realistic conversational smishing attacks. The dataset reflects diverse attacker strategies and victim responses…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
