Accuracy vs. Computational Cost Tradeoff in Distributed Computer System Simulation
Adrian Colaso, Pablo Prieto, Jose-Angel Herrero, Pablo Abad, Valentin, Puente, Jose-Angel Gregorio

TL;DR
This paper evaluates the tradeoff between accuracy and computational cost in simulating distributed computer systems, demonstrating that multi-node simulation offers accurate results with manageable overhead.
Contribution
It provides an empirical analysis of the accuracy loss and overhead in distributed system simulation, advocating for multi-node simulation as the preferred approach.
Findings
Single-node evaluation can lead to significant errors in distributed system analysis.
Multi-node simulation incurs acceptable computational overhead.
Accurate simulation is crucial for valid hardware optimization conclusions.
Abstract
Simulation is a fundamental research tool in the computer architecture field. These kinds of tools enable the exploration and evaluation of architectural proposals capturing the most relevant aspects of the highly complex systems under study. Many state-of-the-art simulation tools focus on single-system scenarios, but the scalability required by trending applications has shifted towards distributed computing systems integrated via complex software stacks. Web services with client-server architectures or distributed storage and processing of scale-out data analytics (Big Data) are among the main exponents. The complete simulation of a distributed computer system is the appropriate methodology to conduct accurate evaluations. Unfortunately, this methodology could have a significant impact on the already large computational effort derived from detailed simulation. In this work, we conduct…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Cloud Computing and Resource Management · Distributed and Parallel Computing Systems
