Using BERT Encoding to Tackle the Mad-lib Attack in SMS Spam Detection
Sergio Rojas-Galeano

TL;DR
This study evaluates BERT's effectiveness in SMS spam detection against Mad-lib attacks involving synonym substitutions, demonstrating BERT's superior robustness over traditional models like BoW and TFIDF.
Contribution
It introduces a novel application of BERT for adversarial SMS spam detection and assesses its resilience against synonym substitution attacks.
Findings
BERT achieves 96% balanced accuracy on original data.
BERT maintains high accuracy (~95%) under Mad-lib attack with multiple substitutions.
Traditional models' performance drops to chance levels under attack.
Abstract
One of the stratagems used to deceive spam filters is to substitute vocables with synonyms or similar words that turn the message unrecognisable by the detection algorithms. In this paper we investigate whether the recent development of language models sensitive to the semantics and context of words, such as Google's BERT, may be useful to overcome this adversarial attack (called "Mad-lib" as per the word substitution game). Using a dataset of 5572 SMS spam messages, we first established a baseline of detection performance using widely known document representation models (BoW and TFIDF) and the novel BERT model, coupled with a variety of classification algorithms (Decision Tree, kNN, SVM, Logistic Regression, Naive Bayes, Multilayer Perceptron). Then, we built a thesaurus of the vocabulary contained in these messages, and set up a Mad-lib attack experiment in which we modified each…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpam and Phishing Detection · Advanced Malware Detection Techniques · Network Security and Intrusion Detection
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Attention Dropout · Weight Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Linear Warmup With Linear Decay · Residual Connection · Support Vector Machine · Dense Connections
