SocialDF: Benchmark Dataset and Detection Model for Mitigating Harmful Deepfake Content on Social Media Platforms

Arnesh Batra; Anushk Kumar; Jashn Khemani; Arush Gumber; Arhan Jain; Somil Gupta

arXiv:2506.05538·cs.LG·June 9, 2025

SocialDF: Benchmark Dataset and Detection Model for Mitigating Harmful Deepfake Content on Social Media Platforms

Arnesh Batra, Anushk Kumar, Jashn Khemani, Arush Gumber, Arhan Jain, Somil Gupta

PDF

Open Access 1 Repo

TL;DR

This paper introduces SocialDF, a comprehensive dataset of real-world deepfakes on social media, and a novel multi-factor detection model leveraging large language models to improve deepfake identification.

Contribution

The paper presents a new benchmark dataset, SocialDF, and a multi-modal detection approach using LLMs, advancing deepfake detection capabilities on social media platforms.

Findings

01

SocialDF covers diverse real-world deepfakes from online sources.

02

The LLM-based detection model effectively combines multiple verification factors.

03

Results demonstrate improved accuracy over existing detection methods.

Abstract

The rapid advancement of deep generative models has significantly improved the realism of synthetic media, presenting both opportunities and security challenges. While deepfake technology has valuable applications in entertainment and accessibility, it has emerged as a potent vector for misinformation campaigns, particularly on social media. Existing detection frameworks struggle to distinguish between benign and adversarially generated deepfakes engineered to manipulate public perception. To address this challenge, we introduce SocialDF, a curated dataset reflecting real-world deepfake challenges on social media platforms. This dataset encompasses high-fidelity deepfakes sourced from various online ecosystems, ensuring broad coverage of manipulative techniques. We propose a novel LLM-based multi-factor detection approach that combines facial recognition, automated speech transcription,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

arnesh2212/SocialDF
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Hate Speech and Cyberbullying Detection · Adversarial Robustness in Machine Learning