Learning Seasonal Phytoplankton Communities with Topic Models
Arnold Kalmbach, Heidi M. Sosik, Gregory Dudek, and Yogesh Girdhar

TL;DR
This paper introduces a probabilistic spatiotemporal topic model for phytoplankton communities, capturing co-occurrence patterns and enabling interpretable community representations from time series data.
Contribution
It develops a non-parametric, interpretable model that learns community structures without strong assumptions, improving prediction of taxon distributions from environmental variables.
Findings
Model improves community representation interpretability
Linear regression predicts community distributions effectively
Outperforms PCA in predicting taxon distributions
Abstract
In this work we develop and demonstrate a probabilistic generative model for phytoplankton communities. The proposed model takes counts of a set of phytoplankton taxa in a timeseries as its training data, and models communities by learning sparse co-occurrence structure between the taxa. Our model is probabilistic, where communities are represented by probability distributions over the species, and each time-step is represented by a probability distribution over the communities. The proposed approach uses a non-parametric, spatiotemporal topic model to encourage the communities to form an interpretable representation of the data, without making strong assumptions about the communities. We demonstrate the quality and interpretability of our method by its ability to improve performance of a simplistic regression model. We show that simple linear regression is sufficient to predict the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEnvironmental DNA in Biodiversity Studies · Advanced Text Analysis Techniques · Species Distribution and Climate Change
