Trustworthiness Calibration Framework for Phishing Email Detection Using Large Language Models
Daniyal Ganiuly, Assel Smaiyl

TL;DR
This paper introduces a comprehensive framework for evaluating the trustworthiness of large language models in phishing email detection, emphasizing calibration, consistency, and robustness beyond mere accuracy.
Contribution
The study presents the Trustworthiness Calibration Framework (TCF) and Trustworthiness Calibration Index (TCI), providing a reproducible methodology for assessing LLM reliability in security applications.
Findings
GPT-4 shows the highest trustworthiness profile.
Reliability varies independently of accuracy.
Framework enhances transparency in model dependability assessment.
Abstract
Phishing emails continue to pose a persistent challenge to online communication, exploiting human trust and evading automated filters through realistic language and adaptive tactics. While large language models (LLMs) such as GPT-4 and LLaMA-3-8B achieve strong accuracy in text classification, their deployment in security systems requires assessing reliability beyond benchmark performance. To address this, this study introduces the Trustworthiness Calibration Framework (TCF), a reproducible methodology for evaluating phishing detectors across three dimensions: calibration, consistency, and robustness. These components are integrated into a bounded index, the Trustworthiness Calibration Index (TCI), and complemented by the Cross-Dataset Stability (CDS) metric that quantifies stability of trustworthiness across datasets. Experiments conducted on five corpora, such as SecureMail 2025,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpam and Phishing Detection · Personal Information Management and User Behavior · Misinformation and Its Impacts
