Nalanda: A Socio-Technical Graph for Building Software Analytics Tools at Enterprise Scale
Chandra Maddila, Suhas Shanbhogue, Apoorva Agrawal, Thomas Zimmermann,, Chetan Bansal, Nicole Forsgren, Divyanshu Agrawal, Kim Herzig, Arie van, Deursen

TL;DR
Nalanda is a large-scale socio-technical graph platform that enhances software development by providing effective information discovery and recommendations, supporting collaboration across extensive repositories and user bases.
Contribution
This paper introduces Nalanda, a novel large-scale socio-technical graph and recommendation system designed for enterprise software analytics at scale.
Findings
Nalanda graph contains over 37 million nodes and 128 million edges.
The platform's newsfeed has 290 daily and 590 monthly active users.
The recommendation system achieves 78% accuracy in top suggestions.
Abstract
Software development is information-dense knowledge work that requires collaboration with other developers and awareness of artifacts such as work items, pull requests, and files. With the speed of development increasing, information overload is a challenge for people developing and maintaining these systems. Finding information and people is difficult for software engineers, especially when they work in large software systems or have just recently joined a project. In this paper, we build a large scale data platform named Nalanda platform, which contains two subsystems: 1. A large scale socio-technical graph system, named Nalanda graph system 2. A large scale recommendation system, named Nalanda index system that aims at satisfying the information needs of software developers. The Nalanda graph is an enterprise scale graph with data from 6,500 repositories, with 37,410,706 nodes and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Engineering Techniques and Practices · Software System Performance and Reliability
