A Bayesian Record Linkage Approach to Applications in Tree Demography Using Overlapping LiDAR Scans
L. Drew, A. Kaplan, I. Breckheimer

TL;DR
This paper presents a Bayesian hierarchical model for record linkage in spatial data, specifically applied to LiDAR scans of trees, enabling accurate individual identification and growth estimation despite noise and overlapping data sources.
Contribution
It introduces a novel two-stage Bayesian framework that links record matching with tree growth modeling, incorporating uncertainty propagation and scalability strategies.
Findings
Model performs well on simulated data.
Application to real LiDAR data reveals topographic effects on tree growth.
Provides robust uncertainty quantification in complex spatial data linkage.
Abstract
In the information age, it has become increasingly common for data containing records about overlapping individuals to be distributed across multiple sources, making it necessary to identify which records refer to the same individual. The goal of record linkage is to estimate this unknown structure in the absence of a unique identifiable attribute. We introduce a Bayesian hierarchical record linkage model for spatial location data motivated by the estimation of individual specific growth-size curves for conifer species using data derived from overlapping LiDAR scans. Annual tree growth may be estimated dependent upon correctly identifying unique individuals across scans in the presence of noise. We formalize a two-stage modeling framework, connecting the record linkage model and a flexible downstream individual tree growth model, that provides robust uncertainty quantification and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsForensic and Genetic Research · Data Quality and Management · Privacy-Preserving Technologies in Data
