Growing the Simulation Ecosystem: Introducing Mesa Data to Provide Transparent, Accessible and Extensible Data Pipelines for Simulation Development
Thomas Pike, Samantha Golden, Daniel Lowdermilk, Brandon Luong,, Benjamin Rosado

TL;DR
This paper introduces Mesa Data, a set of transparent, accessible, and extensible data pipelines designed to improve data handling and reproducibility in agent-based modeling, demonstrated through crop yield and synthetic population examples.
Contribution
It presents a new ecosystem component that enhances data pipelines for simulation development, focusing on transparency, accessibility, and extensibility to support community collaboration.
Findings
Data pipelines from download to integration into ABM are demonstrated.
Pipelines are transparent, guiding users step-by-step.
Open-source pipelines facilitate community development.
Abstract
The Agent Based Model community has a rich and diverse ecosystem of libraries, platforms, and applications to help modelers develop rigorous simulations. Despite this robust and diverse ecosystem, the complexity of life from microbial communities to the global ecosystem still presents substantial challenges in making reusable code that can optimize the ability of the knowledge-sharing and reproducibility. This research seeks to provide new tools to mitigate some of these challenges by offering a vision of a more holistic ecosystem that takes researchers and practitioners from the data collection through validation, with transparent, accessible, and extensible subcomponents. This proposed approach is demonstrated through two data pipelines (crop yield and synthetic population) that take users from data download through the cleaning and processing until users of have data that can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Distributed and Parallel Computing Systems
