On the role of benchmarking data sets and simulations in method comparison studies
Sarah Friedrich, Tim Friede

TL;DR
This paper examines the roles of benchmarking datasets and simulations in method comparison studies, highlighting their advantages and disadvantages, and proposes new combined approaches for more effective evaluation.
Contribution
It provides a comparative analysis of benchmarking data and simulation methods, introducing innovative strategies that integrate both to improve method evaluation.
Findings
Benchmarking datasets offer real-world relevance but may lack diversity.
Simulation studies provide controlled environments but may lack realism.
Combining both approaches can enhance evaluation robustness.
Abstract
Method comparisons are essential to provide recommendations and guidance for applied researchers, who often have to choose from a plethora of available approaches. While many comparisons exist in the literature, these are often not neutral but favour a novel method. Apart from the choice of design and a proper reporting of the findings, there are different approaches concerning the underlying data for such method comparison studies. Most manuscripts on statistical methodology rely on simulation studies and provide a single real-world data set as an example to motivate and illustrate the methodology investigated. In the context of supervised learning, in contrast, methods are often evaluated using so-called benchmarking data sets, i.e. real-world data that serve as gold standard in the community. Simulation studies, on the other hand, are much less common in this context. The aim of this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods in Clinical Trials · Meta-analysis and systematic reviews · Statistics Education and Methodologies
