LLMs are One-Shot URL Classifiers and Explainers
Fariza Rashid, Nishavi Ranaweera, Ben Doyle, Suranga Seneviratne

TL;DR
This paper explores using large language models with one-shot learning and Chain-of-Thought reasoning to classify URLs as benign or phishing, providing natural language explanations and achieving performance comparable to supervised models.
Contribution
It introduces a novel LLM-based one-shot URL classification framework that includes explanation generation, addressing generalisation and interpretability issues in cyber security.
Findings
GPT 4-Turbo achieved the best performance among tested LLMs.
LLM explanations aligned well with supervised classifier explanations.
High readability and informativeness of LLM-generated explanations.
Abstract
Malicious URL classification represents a crucial aspect of cyber security. Although existing work comprises numerous machine learning and deep learning-based URL classification models, most suffer from generalisation and domain-adaptation issues arising from the lack of representative training datasets. Furthermore, these models fail to provide explanations for a given URL classification in natural human language. In this work, we investigate and demonstrate the use of Large Language Models (LLMs) to address this issue. Specifically, we propose an LLM-based one-shot learning framework that uses Chain-of-Thought (CoT) reasoning to predict whether a given URL is benign or phishing. We evaluate our framework using three URL datasets and five state-of-the-art LLMs and show that one-shot LLM prompting indeed provides performances close to supervised models, with GPT 4-Turbo being the best…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Web Data Mining and Analysis · Spam and Phishing Detection
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Cosine Annealing · Multi-Head Attention · Weight Decay · Linear Warmup With Cosine Annealing · Adam · Residual Connection · Byte Pair Encoding
