Poisson point process models solve the "pseudo-absence problem" for presence-only data in ecology
David I. Warton, Leah C. Shepherd

TL;DR
This paper introduces Poisson point process models as a robust alternative to the traditional pseudo-absence approach in ecology, improving species distribution modeling with presence-only data.
Contribution
It establishes a formal link between Poisson point process modeling and logistic regression, guiding better pseudo-absence selection and model interpretation.
Findings
Poisson point process models effectively address the pseudo-absence problem.
As pseudo-absences increase, logistic regression converges to the Poisson model.
Point process modeling improves pseudo-absence choice and model accuracy.
Abstract
Presence-only data, point locations where a species has been recorded as being present, are often used in modeling the distribution of a species as a function of a set of explanatory variables---whether to map species occurrence, to understand its association with the environment, or to predict its response to environmental change. Currently, ecologists most commonly analyze presence-only data by adding randomly chosen "pseudo-absences" to the data such that it can be analyzed using logistic regression, an approach which has weaknesses in model specification, in interpretation, and in implementation. To address these issues, we propose Poisson point process modeling of the intensity of presences. We also derive a link between the proposed approach and logistic regression---specifically, we show that as the number of pseudo-absences increases (in a regular or uniform random arrangement),…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
