# Tracking SARS-CoV-2 genomic variants in wastewater sequencing data with LolliPop

**Authors:** David Dreifuss, Ivan Topolsky, Pelin Icer Baykal, Niko Beerenwinkel

PMC · DOI: 10.1371/journal.pcbi.1014003 · PLOS Computational Biology · 2026-02-19

## TL;DR

LolliPop is a new method for tracking SARS-CoV-2 variants in wastewater sequencing data, enabling accurate variant abundance estimates even with missing data.

## Contribution

LolliPop introduces a statistical approach using temporal regularization for robust variant deconvolution in wastewater sequencing.

## Key findings

- LolliPop accurately estimates variant abundances in wastewater data with high missingness.
- The method uses temporal regularization equivalent to kernel smoothing for robustness.
- LolliPop performs well on both simulated and real Swiss wastewater data.

## Abstract

During the COVID-19 pandemic, wastewater-based epidemiology has progressively taken a central role as a pathogen surveillance tool. Tracking viral loads and variant outbreaks in sewage offers advantages over clinical surveillance methods by providing estimates not biased by testing practices and enabling early detection. However, wastewater-based epidemiology poses new computational research questions that need to be solved in order for this approach to be implemented broadly and successfully. Here, we address the variant deconvolution problem, where we aim to estimate the relative abundances of genomic variants from next-generation sequencing data of a mixed wastewater sample. We introduce LolliPop, a computational method to solve the variant deconvolution problem. LolliPop is tailored to wastewater time series sequencing data and applies temporal regularization in the form of a fused ridge penalty. We show that this regularization is equivalent to kernel smoothing and that it makes abundance estimates robust to very high levels of missing data, which is common for wastewater sequencing. We use the bootstrap to produce confidence intervals, and develop analytical standard errors that can produce similar confidence intervals at a fraction of the computational cost. We demonstrate the application of our method to data from the Swiss wastewater surveillance efforts as well as on simulated data.

Wastewater-based epidemiology has become a valuable tool for tracking viruses like SARS-CoV-2 across entire communities. Sequencing wastewater can reveal which viral variants are circulating, offering early insights into variant dynamics while avoiding the biases of clinical testing. A central challenge is to infer the relative abundances of these variants from observed mutation data. This task is complicated by the fact that variant profiles can be highly similar, and the data is often noisy with many missing read count values from genomic positions with no coverage, especially when the incidence of the pathogen is low. We developed LolliPop, a statistical method that leverages the time series structure of wastewater data to robustly deconvolve variant abundances and compute fast confidence intervals. Using both simulated data and real data from the Swiss national variant monitoring, we show that LolliPop is accurate and robust to high levels of missing data.

## Linked entities

- **Diseases:** SARS-CoV-2 (MONDO:0100096)

## Full-text entities

- **Genes:** TAF1A (TATA-box binding protein associated factor, RNA polymerase I subunit A) [NCBI Gene 9015] {aka MGC:17061, RAFI48, SL1, TAFI48}
- **Diseases:** COVID-19 (MESH:D000086382), infection (MESH:D007239), influenza (MESH:D007251)
- **Chemicals:** lead (MESH:D007854), BA.4 (-)
- **Species:** Respiratory syncytial virus (no rank) [taxon 12814], Severe acute respiratory syndrome coronavirus 2 (no rank) [taxon 2697049], H1N1 subtype (serotype) [taxon 114727]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12945317/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12945317/full.md

## References

37 references — full list in the complete paper: https://tomesphere.com/paper/PMC12945317/full.md

---
Source: https://tomesphere.com/paper/PMC12945317