Multimodal Detection of Fake Reviews using BERT and ResNet-50
Suhasnadh Reddy Veluru, Sai Teja Erukude, Viswa Chaitanya Marella

TL;DR
This paper introduces a multimodal fake review detection system combining textual analysis with BERT and visual analysis with ResNet-50, significantly improving accuracy over unimodal methods in identifying deceptive reviews.
Contribution
It presents a novel multimodal framework that fuses textual and visual features for more effective fake review detection, supported by a new dataset and superior experimental results.
Findings
Multimodal model achieves an F1-score of 0.934.
Outperforms unimodal baseline models.
Detects subtle inconsistencies in reviews.
Abstract
In the current digital commerce landscape, user-generated reviews play a critical role in shaping consumer behavior, product reputation, and platform credibility. However, the proliferation of fake or misleading reviews often generated by bots, paid agents, or AI models poses a significant threat to trust and transparency within review ecosystems. Existing detection models primarily rely on unimodal, typically textual, data and therefore fail to capture semantic inconsistencies across different modalities. To address this gap, a robust multimodal fake review detection framework is proposed, integrating textual features encoded with BERT and visual features extracted using ResNet-50. These representations are fused through a classification head to jointly predict review authenticity. To support this approach, a curated dataset comprising 21,142 user-uploaded images across food delivery,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpam and Phishing Detection · Misinformation and Its Impacts · Hate Speech and Cyberbullying Detection
