A Double Machine Learning Trend Model for Citizen Science Data
Daniel Fink (1), Alison Johnston (2), Matt Strimas-Mackey (1), Tom, Auer (1), Wesley M. Hochachka (1), Shawn Ligocki (1), Lauren Oldham Jaromczyk, (1), Orin Robinson (1), Chris Wood (1), Steve Kelling (1), and Amanda D., Rodewald (1) ((1) Cornell Lab of Ornithology

TL;DR
This paper introduces a novel double machine learning approach to accurately estimate spatially detailed species population trends from citizen science data, effectively controlling for interannual confounding and data biases.
Contribution
It develops a new modeling framework combining double machine learning with simulation to adjust for confounding in citizen science datasets, enabling reliable spatial trend estimation.
Findings
Successfully distinguished spatially varying and constant trends at 27km resolution
Low error rates in trend direction estimation
High correlation in trend magnitude estimates
Abstract
1. Citizen and community-science (CS) datasets have great potential for estimating interannual patterns of population change given the large volumes of data collected globally every year. Yet, the flexible protocols that enable many CS projects to collect large volumes of data typically lack the structure necessary to keep consistent sampling across years. This leads to interannual confounding, as changes to the observation process over time are confounded with changes in species population sizes. 2. Here we describe a novel modeling approach designed to estimate species population trends while controlling for the interannual confounding common in citizen science data. The approach is based on Double Machine Learning, a statistical framework that uses machine learning methods to estimate population change and the propensity scores used to adjust for confounding discovered in the data.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpecies Distribution and Climate Change · Remote Sensing in Agriculture · Data-Driven Disease Surveillance
