Effectiveness and predictability of in-network storage cache for scientific workflows
Caitlin Sim, Kesheng Wu, Alex Sim, Inder Monga, Chin Guok, Frank, Wurthwein, Diego Davila, Harvey Newman, Justas Balcas

TL;DR
This study evaluates a regional data cache for scientific workflows, demonstrating significant reduction in wide-area network traffic and showing that machine learning can accurately predict cache behavior for better resource planning.
Contribution
The paper provides an empirical analysis of a regional cache system's effectiveness and introduces a machine learning model to predict cache performance in scientific data access scenarios.
Findings
67.6% of file requests removed from WAN
Reduced WAN traffic volume by 12.3TB daily
ML model accurately predicts cache behavior
Abstract
Large scientific collaborations often have multiple scientists accessing the same set of files while doing different analyses, which create repeated accesses to the large amounts of shared data located far away. These data accesses have long latency due to distance and occupy the limited bandwidth available over the wide-area network. To reduce the wide-area network traffic and the data access latency, regional data storage caches have been installed as a new networking service. To study the effectiveness of such a cache system in scientific applications, we examine the Southern California Petabyte Scale Cache for a high-energy physics experiment. By examining about 3TB of operational logs, we show that this cache removed 67.6% of file requests from the wide-area network and reduced the traffic volume on wide-area network by 12.3TB (or 35.4%) an average day. The reduction in the traffic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methodstravel james
