Can ChatGPT Detect DeepFakes? A Study of Using Multimodal Large Language   Models for Media Forensics

Shan Jia; Reilin Lyu; Kangran Zhao; Yize Chen; Zhiyuan Yan; Yan Ju,; Chuanbo Hu; Xin Li; Baoyuan Wu; Siwei Lyu

arXiv:2403.14077·cs.AI·June 12, 2024·1 cites

Can ChatGPT Detect DeepFakes? A Study of Using Multimodal Large Language Models for Media Forensics

Shan Jia, Reilin Lyu, Kangran Zhao, Yize Chen, Zhiyuan Yan, Yan Ju,, Chuanbo Hu, Xin Li, Baoyuan Wu, Siwei Lyu

PDF

Open Access 1 Repo

TL;DR

This paper explores the potential of multimodal large language models to detect AI-generated DeepFake media, demonstrating their capabilities through experiments and discussing their limitations and possible enhancements.

Contribution

It is the first to evaluate multimodal LLMs for media forensics, showing they can identify DeepFakes without specialized programming.

Findings

01

Multimodal LLMs can detect DeepFake images with careful prompt design.

02

The models reveal AI-generated media through qualitative and quantitative tests.

03

Limitations of current models are identified and discussed.

Abstract

DeepFakes, which refer to AI-generated media content, have become an increasing concern due to their use as a means for disinformation. Detecting DeepFakes is currently solved with programmed machine learning algorithms. In this work, we investigate the capabilities of multimodal large language models (LLMs) in DeepFake detection. We conducted qualitative and quantitative experiments to demonstrate multimodal LLMs and show that they can expose AI-generated images through careful experimental design and prompt engineering. This is interesting, considering that LLMs are not inherently tailored for media forensic tasks, and the process does not require programming. We discuss the limitations of multimodal LLMs for these tasks and suggest possible improvements.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shanface33/gpt4mf_ub
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMisinformation and Its Impacts · Hate Speech and Cyberbullying Detection