# A digital twin for real-time biodiversity forecasting with citizen science data

**Authors:** Otso Ovaskainen, Steven Winter, Gleb Tikhonov, Patrik Lauha, Ari Lehtiö, Ossi Nokelainen, Nerea Abrego, Anni Aroluoma, Jesse Patrick Harrison, Mikko Heikkinen, Aleksi Kallio, Anniina Koliseva, Aleksi Lehikoinen, Tomas Roslin, Panu Somervuo, Allan Tainá Souza, Jemal Tahir, Jussi Talaskivi, Alpo Turunen, Aurélie Vancraeyenest, Gabriela Zuquim, Hannu Autto, Jari Hänninen, Jasmin Inkinen, Outa Kalttopää, Janne Koskinen, Matti Kotakorpi, Kim Kuntze, John Loehr, Marko Mutanen, Mikko Oranen, Riku Paavola, Risto Renkonen, Pauliina Schiestl-Aalto, Mikko Sipilä, Maija Sujala, Janne Sundell, Saana Tepsa, Esa-Pekka Tuominen, Joni Uusitalo, Mikko Vallinmäki, Emma Vatka, Silja Veikkolainen, Phillip C. Watts, David Dunson

PMC · DOI: 10.1038/s41559-025-02966-3 · Nature Ecology & Evolution · 2026-01-27

## TL;DR

This paper introduces a digital twin system that uses citizen science audio data and machine learning to improve real-time biodiversity monitoring and forecasting.

## Contribution

A novel digital twin approach that enables non-expert citizens to contribute to accurate biodiversity predictions using smartphone audio recordings.

## Key findings

- The app generated 15 million bird detections over two years using citizen-collected audio data.
- Digital-twin-informed models showed higher accuracy in predicting bird distributions compared to traditional methods.
- The system reduces data quality variation and sampling biases through machine learning and interval recordings.

## Abstract

Citizen science provides large amounts of biodiversity data. Key challenges in unlocking its full potential include engaging citizens with limited species identification skills and accelerating the transition from data collection to research and monitoring outputs. Here we use a large dataset from Finland to show how even citizens who cannot identify birds themselves can contribute to real-time predictions of avian distributions. This is achieved through a digital twin that combines smartphone-based citizen science with long-term knowledge in a continuously updating model. The app submits raw audio to a backend that classifies birds with machine learning, reducing variation in data quality and enabling validation and reclassification by continuously improving classifiers. We counteracted spatiotemporal sampling biases by interval recordings and permanent point count networks. Over 2 years, the app generated 15 million bird detections. Independent test data show that the digital-twin-informed models are more accurate at predicting bird spatiotemporal distributions. Because our approach is highly scalable and has the potential to generate biomonitoring data even in understudied areas, it could accelerate the flow of reliable biodiversity information and increase inclusivity in citizen science projects.

Citizen science data are increasingly used in biodiversity monitoring. This study applies a digital twin approach to biodiversity monitoring using a large citizen science dataset on birds from Finland, demonstrating its potential for ecological forecasting.

## Full-text entities

- **Diseases:** HMSC (MESH:D003147), PAM (MESH:D014202), MK (MESH:D007706)
- **Chemicals:** DT (-)
- **Species:** Acrocephalus schoenobaenus (sedge warbler, species) [taxon 52609], Cepora nerissa (common gull, species) [taxon 441320], Sylvia borin (garden warbler, species) [taxon 73324], Homo sapiens (human, species) [taxon 9606], Larus canus (mew gull, species) [taxon 28681]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12971481/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12971481/full.md

## References

16 references — full list in the complete paper: https://tomesphere.com/paper/PMC12971481/full.md

---
Source: https://tomesphere.com/paper/PMC12971481