Deploying in-network caches in support of distributed scientific data sharing
Alex Sim, Ezra Kissel, and Chin Guok

TL;DR
This paper explores deploying in-network caches to enhance data sharing efficiency and network bandwidth preservation in scientific environments, detailing deployment strategies, performance considerations, and future scalability options.
Contribution
It introduces a novel approach to in-network caching for scientific data distribution, including deployment details and scalability methods.
Findings
Improved application performance through caching
Reduced network bandwidth usage
Scalable deployment models proposed
Abstract
The importance of intelligent data placement, management, and analysis has become apparent as scientific data volumes across the network continue to increase. To that end, we describe the use of in-network caching service deployments as a means to improve application performance and preserve available network bandwidth in a high energy physics data distribution environment. Details of the software and hardware deployments, performance considerations, and cache usage analysis will be described. We include thoughts on possible future deployment models involving caching node installations at the edge along with methods to scale our approach.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems · Advanced Data Storage Technologies · Distributed systems and fault tolerance
