The Ubiquity of Large Graphs and Surprising Challenges of Graph Processing: Extended Survey
Siddhartha Sahu, Amine Mhedhbi, Semih Salihoglu, Jimmy Lin, M. Tamer, \"Ozsu

TL;DR
This extensive survey reveals that real-world graph processing involves diverse, large datasets with major challenges in scalability and visualization, and highlights popular applications like data integration and fraud detection.
Contribution
The paper provides the first comprehensive empirical study on practical graph usage, identifying key challenges and application domains in real-world graph processing.
Findings
Graphs are highly diverse and often very large in practice.
Scalability and visualization are the most pressing challenges.
Popular applications include data integration, recommendations, and fraud detection.
Abstract
Graph processing is becoming increasingly prevalent across many application domains. In spite of this prevalence, there is little research about how graphs are actually used in practice. We performed an extensive study that consisted of an online survey of 89 users, a review of the mailing lists, source repositories, and whitepapers of a large suite of graph software products, and in-person interviews with 6 users and 2 developers of these products. Our online survey aimed at understanding: (i) the types of graphs users have; (ii) the graph computations users run; (iii) the types of graph software users use; and (iv) the major challenges users face when processing their graphs. We describe the participants' responses to our questions highlighting common patterns and challenges. Based on our interviews and survey of the rest of our sources, we were able to answer some new questions that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
