Verifying Urdu news authenticity using deep learning with concatenated BERT and GloVe embedding

Asif Feroz; Waseem Abbasi; Muhammad Zeeshan Babar; Abeer Aljohani

PMC · DOI:10.1038/s41598-026-36771-0·February 5, 2026

Verifying Urdu news authenticity using deep learning with concatenated BERT and GloVe embedding

Asif Feroz, Waseem Abbasi, Muhammad Zeeshan Babar, Abeer Aljohani

PDF

Open Access

TL;DR

This paper presents a deep learning model using BERT and GloVe embeddings to detect fake news in Urdu, achieving high accuracy.

Contribution

A novel Urdu fake news detection system using concatenated BERT and GloVe embeddings with XLM-RoBERTa.

Findings

01

XLM-RoBERTa with concatenated GloVe achieved an F1 score of 0.956 and accuracy of 0.962.

02

The model outperformed other deep learning and machine learning approaches for Urdu fake news detection.

Abstract

Fake news has a significant effect on reader perceptions and is therefore a serious problem. It is difficult to differentiate between fake and real news as the number of News platforms on social media are growing daily. This work aims to create a complete fake news detection mechanism for Pakistani news by using many fact-checked APIs. A total of 14,178 real and fake news items from fifteen different regions and sections of reputable and authoritative Urdu newspapers and news broadcasting websites in Pakistan are included in the dataset. We evaluate the dataset via three deep learning models RoBERTa, XLM-RoBERTa and mBERT are embedded with concatenated BERT and GloVe. For the training of the selected models, we first use an extensive multilingual dataset. The results of the proposed models are as follows: evaluated via performance metrics such as the F1 score, accuracy, precision and…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Genes2

F3 NINL

Proteins2

Species1

Homo sapiens(human · species)

Chemicals1

BERT

Diseases2

FND DL

Figures14

Click any figure to enlarge with its caption.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMisinformation and Its Impacts · Spam and Phishing Detection · Big Data and Digital Economy