Misinformation Detection in Social Media Video Posts
Kehan Wang, David Chan, Seth Z. Zhao, John Canny, Avideh Zakhor

TL;DR
This paper introduces new multimodal methods leveraging contrastive learning and masked language modeling to detect semantic misinformation in social media videos, addressing data scarcity and outperforming existing techniques.
Contribution
Develops two novel multimodal detection methods for misinformation in social media videos using contrastive learning and masked language modeling.
Findings
Outperforms state-of-the-art methods on artificial and real-world datasets
Uses self-supervised learning to handle data scarcity
Introduces a large-scale dataset of 160,000 video posts
Abstract
With the growing adoption of short-form video by social media platforms, reducing the spread of misinformation through video posts has become a critical challenge for social media providers. In this paper, we develop methods to detect misinformation in social media posts, exploiting modalities such as video and text. Due to the lack of large-scale public data for misinformation detection in multi-modal datasets, we collect 160,000 video posts from Twitter, and leverage self-supervised learning to learn expressive representations of joint visual and textual data. In this work, we propose two new methods for detecting semantic inconsistencies within short-form social media video posts, based on contrastive learning and masked language modeling. We demonstrate that our new approaches outperform current state-of-the-art methods on both artificial data generated by random-swapping of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · Spam and Phishing Detection · Topic Modeling
MethodsContrastive Learning
