SCRIPTMIND: Crime Script Inference and Cognitive Evaluation for LLM-based Social Engineering Scam Detection System
Heedou Kim, Changsik Kim, Sanghwa Shin, Jaewoo Kang

TL;DR
ScriptMind is a comprehensive framework that leverages LLMs and cognitive evaluation to improve social engineering scam detection and enhance user awareness through structured reasoning and real-time cognitive impact assessment.
Contribution
It introduces a novel integrated system combining scam reasoning, dataset creation, and cognitive evaluation to advance LLM-based scam detection and user cognition.
Findings
Fine-tuned small LLMs outperform GPT-4o in detection accuracy.
The system reduces false positives and improves scammer utterance prediction.
Cognitive simulation enhances user suspicion and awareness in scam scenarios.
Abstract
Social engineering scams increasingly employ personalized, multi-turn deception, exposing the limits of traditional detection methods. While Large Language Models (LLMs) show promise in identifying deception, their cognitive assistance potential remains underexplored. We propose ScriptMind, an integrated framework for LLM-based scam detection that bridges automated reasoning and human cognition. It comprises three components: the Crime Script Inference Task (CSIT) for scam reasoning, the Crime Script-Aware Inference Dataset (CSID) for fine-tuning small LLMs, and the Cognitive Simulation-based Evaluation of Social Engineering Defense (CSED) for assessing real-time cognitive impact. Using 571 Korean phone scam cases, we built 22,712 structured scammer-sequence training instances. Experimental results show that the 11B small LLM fine-tuned with ScriptMind outperformed GPT-4o by 13%,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsSpam and Phishing Detection · Deception detection and forensic psychology · Mental Health via Writing
