Trustworthiness Calibration Framework for Phishing Email Detection Using Large Language Models

Daniyal Ganiuly; Assel Smaiyl

arXiv:2511.04728·cs.CR·November 10, 2025

Trustworthiness Calibration Framework for Phishing Email Detection Using Large Language Models

Daniyal Ganiuly, Assel Smaiyl

PDF

Open Access

TL;DR

This paper introduces a comprehensive framework for evaluating the trustworthiness of large language models in phishing email detection, emphasizing calibration, consistency, and robustness beyond mere accuracy.

Contribution

The study presents the Trustworthiness Calibration Framework (TCF) and Trustworthiness Calibration Index (TCI), providing a reproducible methodology for assessing LLM reliability in security applications.

Findings

01

GPT-4 shows the highest trustworthiness profile.

02

Reliability varies independently of accuracy.

03

Framework enhances transparency in model dependability assessment.

Abstract

Phishing emails continue to pose a persistent challenge to online communication, exploiting human trust and evading automated filters through realistic language and adaptive tactics. While large language models (LLMs) such as GPT-4 and LLaMA-3-8B achieve strong accuracy in text classification, their deployment in security systems requires assessing reliability beyond benchmark performance. To address this, this study introduces the Trustworthiness Calibration Framework (TCF), a reproducible methodology for evaluating phishing detectors across three dimensions: calibration, consistency, and robustness. These components are integrated into a bounded index, the Trustworthiness Calibration Index (TCI), and complemented by the Cross-Dataset Stability (CDS) metric that quantifies stability of trustworthiness across datasets. Experiments conducted on five corpora, such as SecureMail 2025,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpam and Phishing Detection · Personal Information Management and User Behavior · Misinformation and Its Impacts