Collab: Fostering Critical Identification of Deepfake Videos on Social Media via Synergistic Annotation
Shuning Zhang, Linzhi Wang, Shixuan Li, Yuanyuan Wu, Yuwei Chuai, Luoxi Chen, Xin Yi, Hewu Li

TL;DR
Collab is a web tool that enables collaborative annotation of deepfake videos, improving user detection accuracy and fostering critical evaluation through innovative aggregation and demonstration strategies.
Contribution
The paper introduces Collab, a novel web plugin with confidence-weighted aggregation and hierarchical demonstrations to enhance deepfake detection and user criticality.
Findings
Collab significantly improved deepfake identification accuracy.
Participants showed increased critical reflection with Collab.
The aggregation method outperformed non-aggregation approaches.
Abstract
Identifying deepfake videos on social media platforms is challenged by dynamic spatio-temporal artifacts and inadequate user tools. This hinders both critical viewing by users and scalable moderation on platforms. Here, we present Collab, a web plugin enabling users to collaboratively annotate deepfake videos. Collab integrates three key components: (i) an intuitive interface for spatio-temporal labeling where users provide confidence scores and rationales, facilitating detailed input even from non-experts, (ii) a novel confidence-weighted spatio-temporal Intersection-over-Union (IoU) algorithm to aggregate diverse user annotations into accurate aggregations, and (iii) a hierarchical demonstration strategy presenting aggregated results to guide attention toward contentious regions and foster critical evaluation. A seven-day online study (N=90), where participants annotated suspicious…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Emotion and Mood Recognition · Multimodal Machine Learning Applications
