Structural and Connectivity Patterns in the Maven Central Software Dependency Network
Daniel Ogenrwot, John Businge, Shaikh Arifuzzaman

TL;DR
This study analyzes the Maven Central Java library ecosystem's dependency network, revealing a scale-free, small-world structure with key hubs that enable reuse but also pose systemic risks.
Contribution
It provides a large-scale network analysis of Maven Central, identifying its topology and critical hubs using advanced graph metrics, which was not previously characterized in detail.
Findings
Maven Central has a scale-free, small-world topology.
A few hubs support most of the ecosystem's connectivity.
Critical hubs include testing frameworks and utility libraries.
Abstract
Understanding the structural characteristics and connectivity patterns of large-scale software ecosystems is critical for enhancing software reuse, improving ecosystem resilience, and mitigating security risks. In this paper, we investigate the Maven Central ecosystem, one of the largest repositories of Java libraries, by applying network science techniques to its dependency graph. Leveraging the Goblin framework, we extracted a sample consisting of the top 5,000 highly connected artifacts based on their degree centrality and then performed breadth-first search (BFS) expansion from each selected artifact as a seed node, traversing the graph outward to capture all libraries and releases reachable those seed nodes. This sampling strategy captured the immediate structural context surrounding these libraries resulted in a curated graph comprising of 1.3 million nodes and 20.9 million edges.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
