Time-Aware Datasets are Adaptive Knowledgebases for the New Normal
Abhijit Suprem, Sanjyot Vaidya, Joao Eduardo Ferreira, Calton Pu

TL;DR
This paper introduces time-aware misinformation datasets, including COVID-TAD, to better capture evolving misinformation phenomena, significantly improving classifier accuracy in detecting misinformation over time.
Contribution
It presents the first large-scale, multi-snapshot COVID-19 misinformation dataset and demonstrates the benefits of incorporating time-awareness into misinformation detection models.
Findings
Time-awareness improves misinformation classifier accuracy.
COVID-TAD is the largest multi-snapshot misinformation dataset.
Incorporating temporal data captures evolving misinformation trends.
Abstract
Recent advances in text classification and knowledge capture in language models have relied on availability of large-scale text datasets. However, language models are trained on static snapshots of knowledge and are limited when that knowledge evolves. This is especially critical for misinformation detection, where new types of misinformation continuously appear, replacing old campaigns. We propose time-aware misinformation datasets to capture time-critical phenomena. In this paper, we first present evidence of evolving misinformation and show that incorporating even simple time-awareness significantly improves classifier accuracy. Second, we present COVID-TAD, a large-scale COVID-19 misinformation da-taset spanning 25 months. It is the first large-scale misinformation dataset that contains multiple snapshots of a datastream and is orders of magnitude bigger than related misinformation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · Topic Modeling · Multimodal Machine Learning Applications
