Large-Scale Graphs Community Detection using Spark GraphFrames
Elena-Simona Apostol, Adrian-Cosmin Cojocaru, Ciprian-Octavian, Truic\u{a}

TL;DR
This paper presents a scalable framework for large-scale community detection in graphs using Apache Spark GraphFrames, demonstrating its effectiveness on real-world datasets.
Contribution
Introduces a novel Spark GraphFrames-based framework for community detection algorithms like K-Cliques, Louvain, and Fast Greedy, addressing scalability issues.
Findings
Framework is feasible for large-scale graphs
Algorithms perform well on real-world datasets
Scalability improves with distributed processing
Abstract
With the emergence of social networks, online platforms dedicated to different use cases, and sensor networks, the emergence of large-scale graph community detection has become a steady field of research with real-world applications. Community detection algorithms have numerous practical applications, particularly due to their scalability with data size. Nonetheless, a notable drawback of community detection algorithms is their computational intensity~\cite{Apostol2014}, resulting in decreasing performance as data size increases. For this purpose, new frameworks that employ distributed systems such as Apache Hadoop and Apache Spark which can seamlessly handle large-scale graphs must be developed. In this paper, we propose a novel framework for community detection algorithms, i.e., K-Cliques, Louvain, and Fast Greedy, developed using Apache Spark GraphFrames. We test their performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Advanced Graph Neural Networks · Graph Theory and Algorithms
