Strengthening False Information Propagation Detection: Leveraging SVM and Sophisticated Text Vectorization Techniques in comparison to BERT
Ahmed Akib Jawad Karim, Kazi Hafiz Md Asad, Aznur Azam

TL;DR
This paper compares machine learning techniques, specifically SVM with various text vectorizations and BERT, for detecting fake news, finding that SVM with BoW performs nearly as well as BERT but with lower computational costs.
Contribution
The study provides a comprehensive comparison of SVM and BERT for fake news detection using multiple text vectorization methods, highlighting the competitive performance of SVM with BoW.
Findings
BERT achieves 99.98% accuracy and 0.9998 F1-score.
SVM with BoW achieves 99.81% accuracy and 0.9980 F1-score.
SVM with BoW offers a computationally efficient alternative to BERT.
Abstract
The rapid spread of misinformation, particularly through online platforms, underscores the urgent need for reliable detection systems. This study explores the utilization of machine learning and natural language processing, specifically Support Vector Machines (SVM) and BERT, to detect fake news. We employ three distinct text vectorization methods for SVM: Term Frequency Inverse Document Frequency (TF-IDF), Word2Vec, and Bag of Words (BoW), evaluating their effectiveness in distinguishing between genuine and fake news. Additionally, we compare these methods against the transformer large language model, BERT. Our comprehensive approach includes detailed preprocessing steps, rigorous model implementation, and thorough evaluation to determine the most effective techniques. The results demonstrate that while BERT achieves superior accuracy with 99.98% and an F1-score of 0.9998, the SVM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Multi-Head Attention · Attention Dropout · Dense Connections · Adam · Residual Connection · Weight Decay · Support Vector Machine
