SciChain: Trustworthy Scientific Data Provenance
Abdullah Al-Mamun, Dongfang Zhao

TL;DR
SciChain introduces a blockchain-based system for high-performance computing that ensures trustworthy scientific data provenance with minimal performance overhead.
Contribution
It presents SciChain, the first blockchain system tailored for HPC, featuring the POST protocol for immutable and scalable scientific data provenance.
Findings
SciChain guarantees data trustworthiness effectively.
It achieves significantly lower overhead compared to existing systems.
Experimental results validate its efficiency and reliability.
Abstract
The state-of-the-art for auditing and reproducing scientific applications on high-performance computing (HPC) systems is through a data provenance subsystem. While recent advances in data provenance lie in reducing the performance overhead and improving the user's query flexibility, the fidelity of data provenance is often overlooked: there is no such a way to ensure that the provenance data itself has not been fabricated or falsified. This paper advocates to leverage blockchains to deliver immutable and autonomous data provenance services such that scientific data are trustworthy. The challenges for adopting blockchains to HPC include designing a new blockchain architecture compatible with the HPC platforms and, more importantly, a set of new consensus protocols for scientific applications atop blockchains. To this end, we have designed the proof-of-scalable-traceability (POST)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBlockchain Technology Applications and Security · Scientific Computing and Data Management · Innovative Microfluidic and Catalytic Techniques Innovation
