Exploring the evolution of research topics during the COVID-19 pandemic
Francesco Invernici, Anna Bernasconi, Stefano Ceri

TL;DR
This paper introduces CORToViz, a visualization tool for analyzing the evolution of research topics in the extensive CORD-19 scientific corpus, utilizing advanced clustering, temporal mining, and interactive visualization techniques.
Contribution
The paper presents a novel method and visualization tool for exploring research topic dynamics in large scientific corpora, adaptable to various textual datasets.
Findings
Effective clustering of scientific articles by topics
Visualization of topic trends over time
Statistical testing for topic significance
Abstract
The COVID-19 pandemic has changed the research agendas of most scientific communities, resulting in an overwhelming production of research articles in a variety of domains, including medicine, virology, epidemiology, economy, psychology, and so on. Several open-access corpora and literature hubs were established; among them, the COVID-19 Open Research Dataset (CORD-19) has systematically gathered scientific contributions for 2.5 years, by collecting and indexing over one million articles. Here, we present the CORD-19 Topic Visualizer (CORToViz), a method and associated visualization tool for inspecting the CORD-19 textual corpus of scientific abstracts. Our method is based upon a careful selection of up-to-date technologies (including large language models), resulting in an architecture for clustering articles along orthogonal dimensions and extraction techniques for temporal topic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Visualization and Analytics · Data Analysis with R
