Improving the Graph Challenge Reference Implementation
Inna Voloshchuk, Hayden Jananthan, Chansup Byun, Jeremy Kepner

TL;DR
This paper refactors and benchmarks a section of the Graph Challenge reference implementation, significantly reducing code size and enhancing clarity, adaptability, and performance for large-scale graph data analysis.
Contribution
It presents a streamlined, efficient, and scalable refactoring of the Graph Challenge reference code, improving usability and performance for large-scale graph processing.
Findings
67% reduction in code size
Achieved scalable performance for large traffic matrices
Enhanced clarity and adaptability of the implementation
Abstract
The MIT/IEEE/Amazon Graph Challenge provides a venue for individuals and teams to showcase new innovations in large-scale graph and sparse data analysis. The Anonymized Network Sensing Graph Challenge processes over 100 billion network packets to construct privacy-preserving traffic matrices, with a GraphBLAS reference implementation demonstrating how hypersparse matrices can be applied to this problem. This work presents a refactoring and benchmarking of a section of the reference code to improve clarity, adaptability, and performance. The original Python implementation spanning approximately 1000 lines across 3 files has been streamlined to 325 lines across two focused modules, achieving a 67% reduction in code size while maintaining full functionality. Using pMatlab and pPython distributed array programming libraries, the addition of parallel maps allowed for parallel benchmarking of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware-Defined Networks and 5G · Advanced Graph Neural Networks · Graph Theory and Algorithms
