MMCFND: Multimodal Multilingual Caption-aware Fake News Detection for Low-resource Indic Languages
Shubhi Bansal, Nishit Sushil Singh, Shahid Shafi Dar, Nagendra Kumar

TL;DR
This paper introduces a new multimodal, multilingual dataset and a caption-aware framework for fake news detection in low-resource Indic languages, leveraging visual and textual cues to improve detection accuracy.
Contribution
The paper presents the first curated multimodal dataset for Indic languages and a novel caption-aware detection framework utilizing pre-trained encoders for improved fake news identification.
Findings
The proposed MMCFND outperforms existing methods in fake news detection accuracy.
The dataset facilitates research in low-resource multilingual fake news detection.
Caption generation enhances the detection of manipulative content.
Abstract
The widespread dissemination of false information through manipulative tactics that combine deceptive text and images threatens the integrity of reliable sources of information. While there has been research on detecting fake news in high resource languages using multimodal approaches, methods for low resource Indic languages primarily rely on textual analysis. This difference highlights the need for robust methods that specifically address multimodal fake news in Indic languages, where the lack of extensive datasets and tools presents a significant obstacle to progress. To this end, we introduce the Multimodal Multilingual dataset for Indic Fake News Detection (MMIFND). This meticulously curated dataset consists of 28,085 instances distributed across Hindi, Bengali, Marathi, Malayalam, Tamil, Gujarati and Punjabi. We further propose the Multimodal Multilingual Caption-aware framework…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · Topic Modeling · Multimodal Machine Learning Applications
