LLMs are One-Shot URL Classifiers and Explainers

Fariza Rashid; Nishavi Ranaweera; Ben Doyle; Suranga Seneviratne

arXiv:2409.14306·cs.AI·September 24, 2024

LLMs are One-Shot URL Classifiers and Explainers

Fariza Rashid, Nishavi Ranaweera, Ben Doyle, Suranga Seneviratne

PDF

Open Access

TL;DR

This paper explores using large language models with one-shot learning and Chain-of-Thought reasoning to classify URLs as benign or phishing, providing natural language explanations and achieving performance comparable to supervised models.

Contribution

It introduces a novel LLM-based one-shot URL classification framework that includes explanation generation, addressing generalisation and interpretability issues in cyber security.

Findings

01

GPT 4-Turbo achieved the best performance among tested LLMs.

02

LLM explanations aligned well with supervised classifier explanations.

03

High readability and informativeness of LLM-generated explanations.

Abstract

Malicious URL classification represents a crucial aspect of cyber security. Although existing work comprises numerous machine learning and deep learning-based URL classification models, most suffer from generalisation and domain-adaptation issues arising from the lack of representative training datasets. Furthermore, these models fail to provide explanations for a given URL classification in natural human language. In this work, we investigate and demonstrate the use of Large Language Models (LLMs) to address this issue. Specifically, we propose an LLM-based one-shot learning framework that uses Chain-of-Thought (CoT) reasoning to predict whether a given URL is benign or phishing. We evaluate our framework using three URL datasets and five state-of-the-art LLMs and show that one-shot LLM prompting indeed provides performances close to supervised models, with GPT 4-Turbo being the best…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Web Data Mining and Analysis · Spam and Phishing Detection

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Cosine Annealing · Multi-Head Attention · Weight Decay · Linear Warmup With Cosine Annealing · Adam · Residual Connection · Byte Pair Encoding