Multimodal Large Language Models for Phishing Webpage Detection and Identification
Jehyun Lee, Peiyuan Lim, Bryan Hooi, Dinil Mon Divakaran

TL;DR
This paper explores the use of multimodal large language models to detect phishing webpages by identifying brands and verifying domains, achieving high accuracy and robustness with interpretability.
Contribution
It introduces a novel two-phase LLM-based system for phishing detection that outperforms existing brand-based methods and offers interpretability and robustness.
Findings
High detection rate at high precision
Outperforms state-of-the-art brand-based systems
Robust against adversarial attacks
Abstract
To address the challenging problem of detecting phishing webpages, researchers have developed numerous solutions, in particular those based on machine learning (ML) algorithms. Among these, brand-based phishing detection that uses models from Computer Vision to detect if a given webpage is imitating a well-known brand has received widespread attention. However, such models are costly and difficult to maintain, as they need to be retrained with labeled dataset that has to be regularly and continuously collected. Besides, they also need to maintain a good reference list of well-known websites and related meta-data for effective performance. In this work, we take steps to study the efficacy of large language models (LLMs), in particular the multimodal LLMs, in detecting phishing webpages. Given that the LLMs are pretrained on a large corpus of data, we aim to make use of their…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpam and Phishing Detection · Misinformation and Its Impacts · Sentiment Analysis and Opinion Mining
