VERITAS-NLI : Validation and Extraction of Reliable Information Through   Automated Scraping and Natural Language Inference

Arjun Shah; Hetansh Shah; Vedica Bafna; Charmi Khandor; Sindhu Nair

arXiv:2410.09455·cs.CL·October 15, 2024

VERITAS-NLI : Validation and Extraction of Reliable Information Through Automated Scraping and Natural Language Inference

Arjun Shah, Hetansh Shah, Vedica Bafna, Charmi Khandor, Sindhu Nair

PDF

Open Access 1 Repo

TL;DR

This paper introduces VERITAS-NLI, a system combining web-scraping and Natural Language Inference to verify news headlines, significantly improving fake news detection accuracy over traditional ML and BERT models.

Contribution

The paper presents a novel approach that integrates web-scraping with NLI models for external knowledge retrieval to verify headlines, addressing generalization issues of existing models.

Findings

01

Achieved 84.3% accuracy in fake news detection

02

Outperformed classical ML models by 33.3%

03

Surpassed BERT by 31.0% in accuracy

Abstract

In today's day and age where information is rapidly spread through online platforms, the rise of fake news poses an alarming threat to the integrity of public discourse, societal trust, and reputed news sources. Classical machine learning and Transformer-based models have been extensively studied for the task of fake news detection, however they are hampered by their reliance on training data and are unable to generalize on unseen headlines. To address these challenges, we propose our novel solution, leveraging web-scraping techniques and Natural Language Inference (NLI) models to retrieve external knowledge necessary for verifying the accuracy of a headline. Our system is evaluated on a diverse self-curated evaluation dataset spanning over multiple news channels and broad domains. Our best performing pipeline achieves an accuracy of 84.3% surpassing the best classical Machine Learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

arjun-254/veritas-nli
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling