NavP: Enabling Navigational Programming for Science Data Processing via Application-Initiated Checkpointing
Lei Pan, Twinkle Jain

TL;DR
NavP introduces a navigational programming approach with checkpointing to improve science data processing in cloud environments, enabling efficient fault tolerance and cost-effective resource utilization.
Contribution
This paper presents NavP, a novel navigational programming framework with checkpointing for distributed science data systems, facilitating easier deployment and better cloud resource management.
Findings
Enables application programmers to navigate computation across nodes.
Allows partial result publishing for future continuation.
Facilitates leveraging Amazon EC2 Spot market efficiently.
Abstract
Science Data Systems (SDS) handle science data from acquisition through processing to distribution. They are deployed in the Cloud today, and the efficiency of Cloud instance utilization is critical to success. Conventional SDS are unable to take advantage of a cost-effective Amazon EC2 spot market, especially for long-running tasks. Some of the difficulties found in current practice at NASA/JPL are: a lack of mechanism for app programmers to save valuable partial results for future processing continuation, the heavy weight from using container-based (Singularity) sandboxes with more than 200,000 OS-level files; and the gap between scientists developing algorithms/programs on a laptop and the SDS experts deploying software in Cloud computing or supercomputing. We present a first proof-of-principle of this using NavP (Navigational Programming) and fault-tolerant computing (FTC) in SDS,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems · Scientific Computing and Data Management · Advanced Data Storage Technologies
