Enhancing Scalability and Performance in Influence Maximization with   Optimized Parallel Processing

Hanjiang Wu; Huan Xu; Joongun Park; Jesmin Jahan Tithi; Fabio; Checconi; Jordi Wolfson-Pou; Fabrizio Petrini; Tushar Krishna

arXiv:2411.09473·cs.DC·November 15, 2024

Enhancing Scalability and Performance in Influence Maximization with Optimized Parallel Processing

Hanjiang Wu, Huan Xu, Joongun Park, Jesmin Jahan Tithi, Fabio, Checconi, Jordi Wolfson-Pou, Fabrizio Petrini, Tushar Krishna

PDF

Open Access

TL;DR

This paper introduces EFFICIENTIMM, an optimized parallel algorithm that significantly improves the scalability and performance of influence maximization methods on shared memory systems, especially on large datasets.

Contribution

The paper proposes EFFICIENTIMM, a novel parallelization and memory optimization approach that enhances the efficiency of influence maximization algorithms like IMM.

Findings

01

Achieved an average 5.9x speedup over Ripples on 8 datasets.

02

Reduced cache misses by 357.4x on the Youtube graph.

03

Demonstrated improved scalability and memory access patterns.

Abstract

Influence Maximization (IM) is vital in viral marketing and biological network analysis for identifying key influencers. Given its NP-hard nature, approximate solutions are employed. This paper addresses scalability challenges in scale-out shared memory system by focusing on the state-of-the-art Influence Maximization via Martingales (IMM) benchmark. To enhance the work efficiency of the current IMM implementation, we propose EFFICIENTIMM with key strategies, including new parallelization scheme, NUMA-aware memory usage, dynamic load balancing and fine-grained adaptive data structures. Benchmarking on a 128-core CPU system with 8 NUMA nodes, EFFICIENTIMM demonstrated significant performance improvements, achieving an average 5.9x speedup over Ripples across 8 diverse SNAP datasets, when compared to the best execution times of the original Ripples framework. Additionally, on the Youtube…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsParallel Computing and Optimization Techniques