EVOSCAT: Exploring Software Change Dynamics in Large-Scale Historical Datasets
Souhaila Serbout, Diana Carolina Mu\~noz Hurtado, Hassan Atwi, Edoardo Riggio, Cesare Pautasso

TL;DR
EvoScat is a visualization tool designed to help researchers analyze large-scale software evolution datasets by providing interactive, scalable visualizations that reveal change dynamics over time across numerous artifacts.
Contribution
The paper introduces EvoScat, a novel interactive visualization tool that enables scalable analysis of large software change datasets from open source repositories.
Findings
Supports visualization of millions of events across thousands of artifacts.
Allows tailored analysis such as change pace comparison and clone detection.
Facilitates exploration of software evolution in large datasets.
Abstract
Long lived software projects encompass a large number of artifacts, which undergo many revisions throughout their history. Empirical software engineering researchers studying software evolution gather and collect datasets with millions of events, representing changes introduced to specific artifacts. In this paper, we propose EvoScat, a tool that attempts addressing temporal scalability through the usage of interactive density scatterplot to provide a global overview of large historical datasets mined from open source repositories in a single visualization. EvoScat intents to provide researchers with a mean to produce scalable visualizations that can help them explore and characterize evolution datasets, as well as comparing the histories of individual artifacts, both in terms of 1) observing how rapidly different artifacts age over multiple-year-long time spans 2) how often metrics…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
