DACP: A Scientific Data Access and Collaboration Protocol
Zhihong Shen, Xiaojie Zhu, Zhenjing Cheng, Hao Ren, Zhaoji Liang, Changfa Lu

TL;DR
DACP is a new protocol designed to improve data sharing, discovery, and collaboration across scientific data centers, addressing limitations of existing network protocols in data-intensive scientific computing.
Contribution
The paper introduces DACP, a novel protocol with a core data model and a reference server, enabling efficient cross-domain scientific data collaboration.
Findings
DACP facilitates data discovery and in-situ computation.
The faird server demonstrates DACP's practical implementation.
DACP enhances interoperability in scientific data infrastructures.
Abstract
Scientific computing is rapidly entering a data-intensive era. However, existing general-purpose network protocol stacks face limitations in eliminating data silos and improving data accessibility and interoperability, making it difficult to effectively meet the demands of emerging paradigms such as AI4Science. To address these challenges, we propose the Data Access and Collaboration Protocol (DACP). DACP defines the Streaming Data Frame (SDF) as its core data model. Through Unified Resource Identification, columnar stream framing, and a reverse supply mechanism, DACP enables data discovery, in-situ computation, and the streaming return of results across scientific data centers, thereby facilitating efficient cross-domain collaboration. Furthermore, this paper introduces faird, a reference server implementation of DACP. This work provides a viable path for building scalable and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
