A New Data Integration Framework for Covid-19 Social Media Information
Lauren Ansell, Luciana Dalla Valle

TL;DR
This paper introduces a novel data integration framework using vine copulas to combine structured official Covid-19 data with unstructured social media insights, improving pandemic modeling accuracy.
Contribution
It presents a new methodology for integrating diverse data sources, enhancing Covid-19 trend assessment beyond traditional single-source approaches.
Findings
Combined data improves pandemic trend accuracy
Social media data adds valuable insights
Vine copulas effectively model dependencies
Abstract
The Covid-19 pandemic presents a serious threat to people health, resulting in over 250 million confirmed cases and over 5 million deaths globally. To reduce the burden on national health care systems and to mitigate the effects of the outbreak, accurate modelling and forecasting methods for short- and long-term health demand are needed to inform government interventions aiming at curbing the pandemic. Current research on Covid-19 is typically based on a single source of information, specifically on structured historical pandemic data. Other studies are exclusively focused on unstructured online retrieved insights, such as data available from social media. However, the combined use of structured and unstructured information is still uncharted. This paper aims at filling this gap, by leveraging historical and social media information with a novel data integration methodology. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · Data-Driven Disease Surveillance · Data Stream Mining Techniques
