The Two Quadrillionth Bit of Pi is 0! Distributed Computation of Pi with Apache Hadoop
Tsz-Wo Sze

TL;DR
This paper reports a new record in computing specific bits of Pi up to the two quadrillionth bit using distributed computing on Apache Hadoop, introducing an elastic scheduling framework for efficient resource utilization.
Contribution
It presents the first computation of the two quadrillionth bit of Pi and develops an elastic framework for scheduling large-scale distributed computations on Hadoop clusters.
Findings
Computed the two quadrillionth bit of Pi at a new record position.
Developed an elastic scheduling framework for Hadoop-based Pi computations.
Achieved a 23-day wall clock computation using a 1000-node cluster.
Abstract
We present a new record on computing specific bits of Pi, the mathematical constant, and discuss performing such computations on Apache Hadoop clusters. The specific bits represented in hexadecimal are 0E6C1294 AED40403 F56D2D76 4026265B CA98511D 0FCFFAA1 0F4D28B1 BB5392B8. These 256 bits end at the 2,000,000,000,000,252nd bit position, which doubles the position and quadruples the precision of the previous known record. The position of the first bit is 1,999,999,999,999,997 and the value of the two quadrillionth bit is 0. The computation is carried out by a MapReduce program called DistBbp. To effectively utilize available cluster resources without monopolizing the whole cluster, we develop an elastic computation framework that automatically schedules computation slices, each a DistBbp job, as either map-side or reduce-side computation based on changing cluster load condition. We have…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Advanced Data Storage Technologies · Advanced Database Systems and Queries
