Comparative Analysis of Black-Box and White-Box Machine Learning Model   in Phishing Detection

Abdullah Fajar; Setiadi Yazid; Indra Budi

arXiv:2412.02084·cs.CR·December 4, 2024

Comparative Analysis of Black-Box and White-Box Machine Learning Model in Phishing Detection

Abdullah Fajar, Setiadi Yazid, Indra Budi

PDF

Open Access

TL;DR

This study compares black-box and white-box machine learning models for phishing detection, evaluating their accuracy and explainability to recommend suitable approaches based on application needs.

Contribution

It provides a systematic analysis of the pros and cons of black-box and white-box models in phishing detection, validated through experiments on public datasets.

Findings

01

White-box models like EBM offer better explainability.

02

Both models perform similarly in accuracy and interpretability.

03

Model choice depends on specific application requirements.

Abstract

Background: Explainability in phishing detection model can support a further solution of phishing attack mitigation by increasing trust and understanding how phishing can be detected. Objective: The aims of this study to determine and best recommendation to apply an approach which has several components with abilities to fulfil the critical needs Methods: A methodology starting with analyzing both black-box and white-box models to get the pros and cons specifically in phishing detection. The conclusion of the analysis will be validated by experiment using a set of well-known algorithms and public phishing datasets. Experimental metrics covers 3 measurements such as predictive accuracy and explainability metrics. Conclusion: Both models are comparable in terms of interpretability and consistency, with room for improvement in diverse datasets. EBM as an example of white-box model is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpam and Phishing Detection

MethodsSparse Evolutionary Training · energy-based model