Verifying Urdu news authenticity using deep learning with concatenated BERT and GloVe embedding
Asif Feroz, Waseem Abbasi, Muhammad Zeeshan Babar, Abeer Aljohani

TL;DR
This paper presents a deep learning model using BERT and GloVe embeddings to detect fake news in Urdu, achieving high accuracy.
Contribution
A novel Urdu fake news detection system using concatenated BERT and GloVe embeddings with XLM-RoBERTa.
Findings
XLM-RoBERTa with concatenated GloVe achieved an F1 score of 0.956 and accuracy of 0.962.
The model outperformed other deep learning and machine learning approaches for Urdu fake news detection.
Abstract
Fake news has a significant effect on reader perceptions and is therefore a serious problem. It is difficult to differentiate between fake and real news as the number of News platforms on social media are growing daily. This work aims to create a complete fake news detection mechanism for Pakistani news by using many fact-checked APIs. A total of 14,178 real and fake news items from fifteen different regions and sections of reputable and authoritative Urdu newspapers and news broadcasting websites in Pakistan are included in the dataset. We evaluate the dataset via three deep learning models RoBERTa, XLM-RoBERTa and mBERT are embedded with concatenated BERT and GloVe. For the training of the selected models, we first use an extensive multilingual dataset. The results of the proposed models are as follows: evaluated via performance metrics such as the F1 score, accuracy, precision and…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 10
Figure 11
Figure 12
Figure 13
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 14Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · Spam and Phishing Detection · Big Data and Digital Economy
