Communication-Optimal Parallel Standard and Karatsuba Integer Multiplication in the Distributed Memory Model
Lorenzo De Stefani

TL;DR
This paper introduces communication-optimal parallel algorithms for standard and Karatsuba integer multiplication in distributed memory systems, achieving optimal or near-optimal speedup, memory usage, and I/O costs.
Contribution
It presents the first parallel algorithms for integer multiplication that are optimal in communication, memory, and I/O costs in distributed memory models.
Findings
Achieves optimal computational time of O(n^2/P) for standard multiplication.
Achieves optimal time of O(n^{log_2 3}/P) for Karatsuba multiplication.
Bandwidth cost matches known I/O lower bounds, and latency is nearly optimal.
Abstract
We present COPSIM a parallel implementation of standard integer multiplication for the distributed memory setting, and COPK a parallel implementation of Karatsuba's fast integer multiplication algorithm for a distributed memory setting. When using processors, each equipped with a local non-shared memory, to compute the product of tho -digits integer numbers, under mild conditions, our algorithms achieve optimal speedup of the computational time. That is, for COPSIM, and for COPK. The total amount of memory required across the processors is , that is, within a constant factor of the minimum space required to store the input values. We rigorously analyze the Input/Output (I/O) cost of the proposed algorithms. We show that their bandwidth cost (i.e., the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCryptography and Data Security · Complexity and Algorithms in Graphs · Advanced Data Storage Technologies
