On the Dependency Heaviness of CRAN/Bioconductor Ecosystem
Zuguang Gu

TL;DR
This paper introduces a new metric called 'dependency heaviness' to analyze the complexity and spread of package dependencies in the CRAN/Bioconductor ecosystem, revealing key packages and dependency paths.
Contribution
It proposes a novel dependency heaviness metric and systematically studies its distribution and impact within the R package ecosystem, supported by a web-based database tool.
Findings
Identified key packages transmitting heavy dependencies
Mapped dependency paths spreading heaviness
Developed a web-based dependency analysis tool
Abstract
The R package ecosystem is expanding fast and dependencies among packages in the ecosystem are becoming more complex. In this study, we explored the package dependencies from a new aspect. We applied a new metric named "dependency heaviness" which measures the number of additional strong dependencies that a package uniquely contributes to its child or downstream packages. It also measures the total reduced dependencies in the ecosystem when the role of a package is changed from a strong parent to a weak parent. We systematically studied how the dependency heaviness spreads from parent to child packages, and how it further spreads to remote downstream packages in the CRAN/Bioconductor ecosystem. We extracted top packages and key paths that majorly transmit heavy dependencies in the ecosystem. Additionally, the dependency heaviness analysis on the ecosystem has been implemented as a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Scientific Computing and Data Management · Advanced Data Storage Technologies
