A Survey of Datasets for Information Diffusion Tasks
Fuxia Guo, Xiaowen Wang, Yanwei Xie, Zehao Wang, Jingqiu Li, Lanjun Wang

TL;DR
This paper systematically categorizes information diffusion tasks and datasets using the 5W Model, providing a comprehensive taxonomy, dataset repository, and analysis to advance research in social media information spread.
Contribution
It introduces a systematic taxonomy of diffusion tasks and compiles a publicly accessible dataset repository based on the 5W Model framework.
Findings
Categorized 10 subtasks of information diffusion
Compared datasets based on six key attributes
Identified limitations and future research directions
Abstract
Information diffusion across various new media platforms gradually influences perceptions, decisions, and social behaviors of individual users. In communication studies, the famous Five W's of Communication model (5W Model) has displayed the process of information diffusion clearly. At present, although plenty of studies and corresponding datasets about information diffusion have emerged, a systematic categorization of tasks and an integration of datasets are still lacking. To address this gap, we survey a systematic taxonomy of information diffusion tasks and datasets based on the "5W Model" framework. We first categorize the information diffusion tasks into ten subtasks with definitions and datasets analysis, from three main tasks of information diffusion prediction, social bot detection, and misinformation detection. We also collect the publicly available dataset repository of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsDiffusion
