# Mapping secondary data gaps for social simulation modelling: A case study of Syrian asylum migration to Europe

**Authors:** Sarah Nurse, Martin Hinsch, Jakub Bijak, Denis Kierans, Alireza Jahani, Edgar Scrase, Zaruhi Mkrtchyan, Alejandra Rodriguez-Sánchez

PMC · DOI: 10.12688/openreseurope.15583.1 · Open Research Europe · 2023-12-08

## TL;DR

This paper introduces a structured method for identifying and addressing data gaps when building simulation models of complex social processes, using Syrian asylum migration to Europe as a case study.

## Contribution

A formal process for assessing and using secondary data in agent-based social simulation models, with a focus on identifying knowledge gaps.

## Key findings

- A thematic map and qualitative uncertainty assessment of key data sources were produced for Syrian migration to Europe.
- The method helps identify data gaps that can be addressed through primary data collection or sensitivity analysis.
- The approach supports question-driven modeling rather than purely data-driven modeling.

## Abstract

Simulation models of social processes may require data that are not readily available, have low accuracy, are incomplete or biased. The paper presents a formal process for collating, assessing, selecting, and using secondary data as part of creating, validating, and documenting an agent-based simulation model of a complex social process, in this case, asylum migration to Europe. The process starts by creating an inventory of data sources, and the associated metadata, followed by assessing different aspects of data quality according to pre-defined criteria. As a result, based on the typology of available data, we are able to produce a thematic map of the area under study, and assess the uncertainty of key data sources, at least qualitatively. We illustrate the process by looking at the data on Syrian migration to Europe in 2011–21.

In parallel, successive stages of the development of a simulation model allow for identifying key types of information which are needed as input into empirically grounded modelling analysis. Juxtaposing the available evidence and model requirements allows for identifying knowledge gaps that need filling, preferably by collecting additional primary data, or, failing that, by carrying out a sensitivity analysis for the assumptions made. By doing so, we offer a way of formalising the data collection process in the context of model-building endeavours, while allowing the modelling to be predominantly question-driven rather than purely data-driven. The paper concludes with recommendations with respect to data and evidence, both for modellers, as well as model users in practice-oriented applications.

We can study migration with computer simulation models. The data we need for that may not be available or be low quality. This paper is about how to use data in modelling. We suggest how to gather the data, check their quality, and use them in models. We show how to find out where we need more data, and how to gather them in an inventory. We use an example of migration from Syria to Europe to point to different problems. How much we know about the data can help us understand what we know and do not know about migration.

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC10873545/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC10873545/full.md

## References

12 references — full list in the complete paper: https://tomesphere.com/paper/PMC10873545/full.md

---
Source: https://tomesphere.com/paper/PMC10873545