Designing a Data Science simulation with MERITS: A Primer
Corrine F Elliott, James PC Duncan, Tiffany M Tang, Merle Behr, Karl Kumbier, and Bin Yu

TL;DR
This paper introduces the MERITS framework, inspired by PCS, to guide the design of high-quality, trustworthy Data Science simulations through six key principles, a conceptual recipe, guidelines, and a case study.
Contribution
It proposes a comprehensive set of six MERITS principles for designing high-quality Data Science simulations, along with a conceptual framework, guidelines, and a practical case study.
Findings
The MERITS framework effectively guides simulation design.
The analogy with cooking helps conceptualize simulation components.
The case study demonstrates practical application of the framework.
Abstract
Simulations play a crucial role in the modern scientific process. Yet despite (or due to) this ubiquity, the Data Science community shares neither a comprehensive definition for a "high-quality" study nor a consolidated guide to designing one. Inspired by the Predictability-Computability-Stability (PCS) framework for 'veridical' Data Science, we propose six MERITS that a simulation study should satisfy. (Modularity and Efficiency support the computability of a study, encouraging clean and flexible implementation. Realism and Stability address the conceptualization of the research problem: How well does a study predict reality, such that its conclusions generalize to new data/contexts? Finally, Intuitiveness and Transparency encourage good communication and trustworthiness of study design and results.) Drawing an analogy between simulation and cooking, we moreover offer (a) a conceptual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBig Data and Business Intelligence
