Scaling multi-species occupancy models to large citizen science datasets
Martin Ingram, Damjan Vukcevic, Nick Golding

TL;DR
This paper introduces scalable approximate Bayesian inference methods using GPUs to fit multi-species occupancy models to large citizen science datasets, improving species distribution estimates and agreement with expert maps.
Contribution
It develops GPU-accelerated variational inference for multi-species occupancy models, enabling analysis of large datasets previously infeasible with traditional methods.
Findings
VI outperformed MCMC and single-species models in predictive accuracy.
Modeling detection process improved agreement with expert maps.
Multi-species models accurately estimated species distributions from large data.
Abstract
Citizen science datasets can be very large and promise to improve species distribution modelling, but detection is imperfect, risking bias when fitting models. In particular, observers may not detect species that are actually present. Occupancy models can estimate and correct for this observation process, and multi-species occupancy models exploit similarities in the observation process, which can improve estimates for rare species. However, the computational methods currently used to fit these models do not scale to large datasets. We develop approximate Bayesian inference methods and use graphics processing units (GPUs) to scale multi-species occupancy models to very large citizen science data. We fit multi-species occupancy models to one month of data from the eBird project consisting of 186,811 checklist records comprising 430 bird species. We evaluate the predictions on a spatially…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpecies Distribution and Climate Change · Wildlife Ecology and Conservation · Ecology and Vegetation Dynamics Studies
MethodsTest · Variational Inference
