"Explain, Don't Just Warn!" -- A Real-Time Framework for Generating Phishing Warnings with Contextual Cues
Sayak Saha Roy, Cesar Torres, Shirin Nilizadeh

TL;DR
PhishXplain is a real-time, explainable phishing warning system that enhances user understanding and detection accuracy by providing contextual cues and visual explanations, outperforming traditional generic warnings.
Contribution
This work introduces PhishXplain, a novel real-time framework that generates contextual, explainable phishing warnings using a structured prompt architecture and a lightweight language model.
Findings
94% of phishing sites detected with 96% correctness
Participants with explainable warnings showed better phishing detection skills
Users reported higher satisfaction and trust with PhishXplain warnings
Abstract
Anti-phishing tools typically display generic warnings that offer users limited explanation on why a website is considered malicious, which can prevent end-users from developing the mental models needed to recognize phishing cues on their own. This becomes especially problematic when these tools inevitably fail - particularly against evasive threats, and users are found to be ill-equipped to identify and avoid them independently. To address these limitations, we present PhishXplain (PXP), a real-time explainable phishing warning system designed to augment existing detection mechanisms. PXP empowers users by clearly articulating why a site is flagged as malicious, highlighting suspicious elements using a memory-efficient implementation of LLaMA 3.2. It utilizes a structured two-step prompt architecture to identify phishing features, generate contextual explanations, and render annotated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpam and Phishing Detection · Misinformation and Its Impacts · Advanced Malware Detection Techniques
