TL;DR
This paper introduces TT-DF, a large-scale diffusion-based dataset for human body forgery detection, along with a novel detection model, addressing the lack of datasets and methods in this domain.
Contribution
The paper presents the first large-scale human body forgery dataset (TT-DF) and a new detection model (TOF-Net) that exploits spatiotemporal inconsistencies for improved detection.
Findings
TOF-Net outperforms existing facial forgery detection models on TT-DF.
TT-DF includes diverse forgery methods and configurations for comprehensive training.
The dataset and model enhance the detection of human body forgeries in real-world scenarios.
Abstract
The emergence and popularity of facial deepfake methods spur the vigorous development of deepfake datasets and facial forgery detection, which to some extent alleviates the security concerns about facial-related artificial intelligence technologies. However, when it comes to human body forgery, there has been a persistent lack of datasets and detection methods, due to the later inception and complexity of human body generation methods. To mitigate this issue, we introduce TikTok-DeepFake (TT-DF), a novel large-scale diffusion-based dataset containing 6,120 forged videos with 1,378,857 synthetic frames, specifically tailored for body forgery detection. TT-DF offers a wide variety of forgery methods, involving multiple advanced human image animation models utilized for manipulation, two generative configurations based on the disentanglement of identity and pose information, as well as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
