Multimodal Large Language Models for Phishing Webpage Detection and   Identification

Jehyun Lee; Peiyuan Lim; Bryan Hooi; Dinil Mon Divakaran

arXiv:2408.05941·cs.CR·August 13, 2024·2 cites

Multimodal Large Language Models for Phishing Webpage Detection and Identification

Jehyun Lee, Peiyuan Lim, Bryan Hooi, Dinil Mon Divakaran

PDF

Open Access 1 Repo

TL;DR

This paper explores the use of multimodal large language models to detect phishing webpages by identifying brands and verifying domains, achieving high accuracy and robustness with interpretability.

Contribution

It introduces a novel two-phase LLM-based system for phishing detection that outperforms existing brand-based methods and offers interpretability and robustness.

Findings

01

High detection rate at high precision

02

Outperforms state-of-the-art brand-based systems

03

Robust against adversarial attacks

Abstract

To address the challenging problem of detecting phishing webpages, researchers have developed numerous solutions, in particular those based on machine learning (ML) algorithms. Among these, brand-based phishing detection that uses models from Computer Vision to detect if a given webpage is imitating a well-known brand has received widespread attention. However, such models are costly and difficult to maintain, as they need to be retrained with labeled dataset that has to be regularly and continuously collected. Besides, they also need to maintain a good reference list of well-known websites and related meta-data for effective performance. In this work, we take steps to study the efficacy of large language models (LLMs), in particular the multimodal LLMs, in detecting phishing webpages. Given that the LLMs are pretrained on a large corpus of data, we aim to make use of their…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jehleekr/multimodal_llm_phishing_detection
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpam and Phishing Detection · Misinformation and Its Impacts · Sentiment Analysis and Opinion Mining