# Bayesian Propagation of Record Linkage Uncertainty into Population Size   Estimation of Human Rights Violations

**Authors:** Mauricio Sadinle

arXiv: 1812.09590 · 2018-12-27

## TL;DR

This paper introduces a Bayesian linkage-averaging method to incorporate record linkage uncertainty into population size estimates, improving accuracy in human rights violation studies with imperfect data.

## Contribution

It proposes a two-stage Bayesian approach that propagates linkage uncertainty into population estimates, allowing flexible model integration and better handling of data errors.

## Key findings

- Effective propagation of linkage uncertainty demonstrated in case study
- Method accommodates various linkage and capture-recapture models
- Improves population size estimates in noisy, incomplete data contexts

## Abstract

Multiple-systems or capture-recapture estimation are common techniques for population size estimation, particularly in the quantitative study of human rights violations. These methods rely on multiple samples from the population, along with the information of which individuals appear in which samples. The goal of record linkage techniques is to identify unique individuals across samples based on the information collected on them. Linkage decisions are subject to uncertainty when such information contains errors and missingness, and when different individuals have very similar characteristics. Uncertainty in the linkage should be propagated into the stage of population size estimation. We propose an approach called linkage-averaging to propagate linkage uncertainty, as quantified by some Bayesian record linkage methodologies, into a subsequent stage of population size estimation. Linkage-averaging is a two-stage approach in which the results from the record linkage stage are fed into the population size estimation stage. We show that under some conditions the results of this approach correspond to those of a proper Bayesian joint model for both record linkage and population size estimation. The two-stage nature of linkage-averaging allows us to combine different record linkage models with different capture-recapture models, which facilitates model exploration. We present a case study from the Salvadoran civil war, where we are interested in estimating the total number of civilian killings using lists of witnesses' reports collected by different organizations. These lists contain duplicates, typographical and spelling errors, missingness, and other inaccuracies that lead to uncertainty in the linkage. We show how linkage-averaging can be used for transferring the uncertainty in the linkage of these lists into different models for population size estimation.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1812.09590/full.md

## Figures

13 figures with captions in the complete paper: https://tomesphere.com/paper/1812.09590/full.md

## References

43 references — full list in the complete paper: https://tomesphere.com/paper/1812.09590/full.md

---
Source: https://tomesphere.com/paper/1812.09590