# Data-centric AI approach for automated wildflower monitoring

**Authors:** Gerard Schouten, Bas S. H. T. Michielsen, Barbara Gravendeel

PMC · DOI: 10.1371/journal.pone.0302958 · PLOS ONE · 2024-09-09

## TL;DR

This paper introduces a dataset and AI model for automatically monitoring wildflowers in various landscapes.

## Contribution

The novel contribution is a data-centric AI approach with a new dataset and model for wildflower monitoring.

## Key findings

- The EWD dataset contains 2,002 images with 65,571 annotations for 160 wildflower species.
- The baseline model achieves an mAP score of 0.82, outperforming similar models.
- The dataset and model support automated assessment of wildflower richness and abundance.

## Abstract

We present the Eindhoven Wildflower Dataset (EWD) as well as a PyTorch object detection model that is able to classify and count wildflowers. EWD, collected over two entire flowering seasons and expert annotated, contains 2,002 top-view images of flowering plants captured ‘in the wild’ in five different landscape types (roadsides, urban green spaces, cropland, weed-rich grassland, marshland). It holds a total of 65,571 annotations for 160 species belonging to 31 different families of flowering plants and serves as a reference dataset for automating wildflower monitoring and object detection in general. To ensure consistent annotations, we define species-specific floral count units and provide extensive annotation guidelines. With a 0.82 mAP (@IoU > 0.50) score the presented baseline model, trained on a balanced subset of EWD, is to the best of our knowledge superior in its class. Our approach empowers automated quantification of wildflower richness and abundance, which helps understanding and assessing natural capital, and encourages the development of standards for AI-based wildflower monitoring. The annotated EWD dataset and the code to train and run the baseline model are publicly available.

## Full-text entities

- **Diseases:** COVID-19 infection (MESH:D000086382), skin lesion (MESH:D012871), melanomas (MESH:D008545), TP (MESH:C579935), FCU (MESH:D056587), FP (MESH:D017541)
- **Chemicals:** sugar (MESH:D000073893), FCU (-), carbon dioxide (MESH:D002245), nitrogen (MESH:D009584)
- **Species:** Berteroa incana (species) [taxon 228874], Leucanthemum vulgare (species) [taxon 99072], Centaurea cyanus (bachelor's button, species) [taxon 41522], Cardamine pratensis (cuckoo flower, species) [taxon 50465], Dianthus deltoides (maiden pink, species) [taxon 202448], Matricaria chamomilla (species) [taxon 98504], Caltha palustris (species) [taxon 3449], Lotus corniculatus (species) [taxon 47247], Pedicularis palustris (species) [taxon 462877], Anthriscus sylvestris (species) [taxon 48027], Lamium purpureum (purple archangel, species) [taxon 53164], Tanacetum vulgare (tansy, species) [taxon 128002], Ranunculus aquatilis (species) [taxon 568530], Dryas integrifolia (species) [taxon 395290], Ranunculus acris (cmmon buttercup, species) [taxon 3447], Daucus carota (carrot, species) [taxon 4039], Ranunculus repens (species) [taxon 137665], Homo sapiens (human, species) [taxon 9606], Filipendula ulmaria (species) [taxon 57917], Cicuta virosa (species) [taxon 48109], Convolvulus arvensis (species) [taxon 4123], Dactylorhiza praetermissa (species) [taxon 337432], Papaver rhoeas (common poppy, species) [taxon 33128], Taraxacum officinale (dandelion, species) [taxon 50225], Galium palustre (species) [taxon 29793], Crepis capillaris (species) [taxon 13465], Veronica longifolia (species) [taxon 165317], Salvia pratensis (species) [taxon 49216], Dactylorhiza maculata (heath spotted orchid, species) [taxon 230572], Lamium album (white dead-nettle, species) [taxon 53159], Hypochaeris radicata (flatweed, species) [taxon 58656], Bellis perennis (English daisy, species) [taxon 41492], Cirsium dissectum (species) [taxon 1155260], Erigeron annuus (eastern daisy fleabane, species) [taxon 91248]
- **Cell lines:** S2 — Drosophila melanogaster (Fruit fly), Spontaneously immortalized cell line (CVCL_Z232)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11383241/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11383241/full.md

## References

88 references — full list in the complete paper: https://tomesphere.com/paper/PMC11383241/full.md

---
Source: https://tomesphere.com/paper/PMC11383241