Automated Inference of Software Library Usage Patterns
Mohamed Aymen Saied, Ali Ouni, Houari Sahraoui, Raula Gaikovina Kula,, Katsuro Inoue, David Lo

TL;DR
This paper introduces a novel hierarchical clustering approach to automatically identify common third-party library usage patterns in software systems, aiding developers in leveraging reuse opportunities more effectively.
Contribution
It presents a new method employing hierarchical clustering to detect library usage patterns based on client data, validated on large-scale Maven and Github datasets.
Findings
Detected 77% of highly consistent library usage patterns
Analyzed over 6,000 libraries and 38,000 client systems
Improved understanding of library co-usage in software development
Abstract
Modern software systems are increasingly dependent on third-party libraries. It is widely recognized that using mature and well-tested third-party libraries can improve developers' productivity, reduce time-to-market, and produce more reliable software. Today's open-source repositories provide a wide range of libraries that can be freely downloaded and used. However, as software libraries are documented separately but intended to be used together, developers are unlikely to fully take advantage of these reuse opportunities. In this paper, we present a novel approach to automatically identify third-party library usage patterns, i.e., collections of libraries that are commonly used together by developers. Our approach employs hierarchical clustering technique to group together software libraries based on external client usage. To evaluate our approach, we mined a large set of over 6,000…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Web Data Mining and Analysis · Data Stream Mining Techniques
