TL;DR
This study systematically examines whether average-case analyses of graph algorithms on simple models are externally valid for real-world networks, finding that model-based performance often translates well and is influenced mainly by heterogeneity and locality.
Contribution
It provides the first systematic evaluation of external validity for average-case graph algorithm analyses on real-world networks.
Findings
Model-based algorithm performance correlates well with real-world networks.
Heterogeneity and locality are key factors affecting algorithm performance.
Performance differences are primarily driven by these network properties.
Abstract
The number one criticism of average-case analysis is that we do not actually know the probability distribution of real-world inputs. Thus, analyzing an algorithm on some random model has no implications for practical performance. At its core, this criticism doubts the existence of external validity, i.e., it assumes that algorithmic behavior on the somewhat simple and clean models does not translate beyond the models to practical performance real-world input. With this paper, we provide a first step towards studying the question of external validity systematically. To this end, we evaluate the performance of six graph algorithms on a collection of 2740 sparse real-world networks depending on two properties; the heterogeneity (variance in the degree distribution) and locality (tendency of edges to connect vertices that are already close). We compare this with the performance on generated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
