Optimal Broadcast Schedules in Logarithmic Time with Applications to Broadcast, All-Broadcast, Reduction and All-Reduction
Jesper Larsson Tr\"aff

TL;DR
This paper presents optimal $O( ext{log} p)$ time algorithms for broadcast, reduction, and related communication problems in parallel systems, improving efficiency and answering open questions from recent research.
Contribution
It introduces new, optimal algorithms for broadcast schedules and related collectives, with schedules computable in $O( ext{log} p)$ time and space, and demonstrates their application to MPI collectives.
Findings
Achieved $O( ext{log} p)$ time for broadcast schedules.
Schedules are computed independently per processor in $O( ext{log} p)$ time.
Algorithms are simple, correct, and improve over previous methods.
Abstract
We give optimally fast time (per processor) algorithms for computing round-optimal broadcast schedules for message-passing parallel computing systems. This affirmatively answers difficult questions posed in a SPAA 2022 BA and a CLUSTER 2022 paper. We observe that the computed schedules and circulant communication graph can likewise be used for reduction, all-broadcast and all-reduction as well, leading to new, round-optimal algorithms for these problems. These observations affirmatively answer open questions posed in a CLUSTER 2023 paper. The problem is to broadcast indivisible blocks of data from a given root processor to all other processors in a (subgraph of a) fully connected network of processors with fully bidirectional, one-ported communication capabilities. In this model, communication rounds are required. Our new algorithms…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
