Does Johnny Get the Message? Evaluating Cybersecurity Notifications for Everyday Users

Victor J\"uttner; Erik Buchmann

arXiv:2505.22435·cs.CR·May 29, 2025

Does Johnny Get the Message? Evaluating Cybersecurity Notifications for Everyday Users

Victor J\"uttner, Erik Buchmann

PDF

Open Access

TL;DR

This paper introduces a framework for evaluating cybersecurity alerts generated by large language models, focusing on their clarity, accuracy, and usefulness for everyday users, and demonstrates its application through various use cases.

Contribution

The paper presents the Human-Centered Security Alert Evaluation Framework (HCSAEF), a novel method for assessing LLM-generated cybersecurity notifications for user comprehension and reliability.

Findings

01

HCSAEF effectively differentiates alerts based on intuitiveness, urgency, and correctness.

02

Prompt design and model choice significantly impact notification quality.

03

HCSAEF supports comparison and improvement of cybersecurity alerts for end users.

Abstract

Due to the increasing presence of networked devices in everyday life, not only cybersecurity specialists but also end users benefit from security applications such as firewalls, vulnerability scanners, and intrusion detection systems. Recent approaches use large language models (LLMs) to rewrite brief, technical security alerts into intuitive language and suggest actionable measures, helping everyday users understand and respond appropriately to security risks. However, it remains an open question how well such alerts are explained to users. LLM outputs can also be hallucinated, inconsistent, or misleading. In this work, we introduce the Human-Centered Security Alert Evaluation Framework (HCSAEF). HCSAEF assesses LLM-generated cybersecurity notifications to support researchers who want to compare notifications generated for everyday users, improve them, or analyze the capabilities of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMental Health via Writing · Spam and Phishing Detection · Personal Information Management and User Behavior