A Comparative Study of Hybrid Models in Health Misinformation Text   Classification

Mkululi Sikosana; Oluwaseun Ajao; Sean Maudsley-Barton

arXiv:2410.06311·cs.IR·October 10, 2024

A Comparative Study of Hybrid Models in Health Misinformation Text Classification

Mkululi Sikosana, Oluwaseun Ajao, Sean Maudsley-Barton

PDF

TL;DR

This study compares machine learning, deep learning, and hybrid models for detecting COVID-19 health misinformation on social media, finding that deep learning and hybrid models outperform traditional ML classifiers in accuracy and robustness.

Contribution

It provides a comprehensive evaluation of various ML and DL models, including hybrid architectures, on a COVID-19 misinformation dataset, highlighting the superior performance of DL and hybrid models.

Findings

01

DL models with Word2Vec embeddings exceed 98% in all metrics

02

Hybrid CNN+LSTM models outperform pretrained models like BERT and RoBERTa

03

SVM achieves a 94.41% F1-score in misinformation detection

Abstract

This study evaluates the effectiveness of machine learning (ML) and deep learning (DL) models in detecting COVID-19-related misinformation on online social networks (OSNs), aiming to develop more effective tools for countering the spread of health misinformation during the pan-demic. The study trained and tested various ML classifiers (Naive Bayes, SVM, Random Forest, etc.), DL models (CNN, LSTM, hybrid CNN+LSTM), and pretrained language models (DistilBERT, RoBERTa) on the "COVID19-FNIR DATASET". These models were evaluated for accuracy, F1 score, recall, precision, and ROC, and used preprocessing techniques like stemming and lemmatization. The results showed SVM performed well, achieving a 94.41% F1-score. DL models with Word2Vec embeddings exceeded 98% in all performance metrics (accuracy, F1 score, recall, precision & ROC). The CNN+LSTM hybrid models also exceeded 98% across…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Linear Layer · Softmax · Multi-Head Attention · Layer Normalization · Dense Connections · Adam · WordPiece · Attention Dropout · Residual Connection