EspressoDB: A scientific database for managing high-performance   computing workflow

Chia Cheng Chang; Christopher K\"orber; Andr\'e Walker-Loud

arXiv:1912.03580·hep-lat·April 16, 2020

EspressoDB: A scientific database for managing high-performance computing workflow

Chia Cheng Chang, Christopher K\"orber, Andr\'e Walker-Loud

PDF

2 Repos

TL;DR

EspressoDB is a Python-based framework designed to streamline and manage complex scientific computing workflows, ensuring data integrity and reducing manual effort at high-performance computing facilities.

Contribution

It introduces a novel object-relational data management system tailored for scientific workflows, enhancing flexibility and ease of use compared to existing solutions.

Findings

01

Improves workflow management efficiency in scientific computing.

02

Centralizes data storage and guarantees data integrity.

03

Reduces human time spent on managing computational jobs.

Abstract

Leadership computing facilities around the world support cutting-edge scientific research across a broad spectrum of disciplines including understanding climate change, combating opioid addiction, or simulating the decay of a neutron. While the increase in computational power has allowed scientists to better evaluate the underlying model, the size of these computational projects have grown to a point where a framework is desired to facilitate managing the workflow. A typical scientific computing workflow includes: Defining all input parameters for every step of the computation; Defining dependencies of computational tasks; Storing some of the output data; Post-processing these data files; Performing data analysis on output. EspressoDB is a programmatic object-relational data management framework implemented in Python and based on the Django web framework. EspressoDB was developed to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.