# Extensive data engineering to the rescue: building a multi-species katydid detector from unbalanced, atypical training datasets

**Authors:** Shyam Madhusudhana, Holger Klinck, Laurel B. Symes

PMC · DOI: 10.1098/rstb.2023.0444 · Philosophical Transactions of the Royal Society B: Biological Sciences · 2024-05-06

## TL;DR

This paper presents a deep learning solution to identify 31 katydid species in a Panamanian forest using data engineering techniques to overcome challenges in unbalanced and mismatched training data.

## Contribution

The study introduces a custom deep learning model and data engineering methods tailored for katydid detection in tropical environments.

## Key findings

- Rigorous data engineering improved model performance despite limited and imbalanced training data.
- Physics-based data augmentation and signal-processing tuning enhanced species recognition accuracy.
- The developed methods are integrated into Koogu, an open-source bioacoustic analysis toolbox.

## Abstract

Passive acoustic monitoring (PAM) is a powerful tool for studying ecosystems. However, its effective application in tropical environments, particularly for insects, poses distinct challenges. Neotropical katydids produce complex species-specific calls, spanning mere milliseconds to seconds and spread across broad audible and ultrasonic frequencies. However, subtle differences in inter-pulse intervals or central frequencies are often the only discriminatory traits. These extremities, coupled with low source levels and susceptibility to masking by ambient noise, challenge species identification in PAM recordings. This study aimed to develop a deep learning-based solution to automate the recognition of 31 katydid species of interest in a biodiverse Panamanian forest with over 80 katydid species. Besides the innate challenges, our efforts were also encumbered by a limited and imbalanced initial training dataset comprising domain-mismatched recordings. To overcome these, we applied rigorous data engineering, improving input variance through controlled playback re-recordings and by employing physics-based data augmentation techniques, and tuning signal-processing, model and training parameters to produce a custom well-fit solution. Methods developed here are incorporated into Koogu, an open-source Python-based toolbox for developing deep learning-based bioacoustic analysis solutions. The parametric implementations offer a valuable resource, enhancing the capabilities of PAM for studying insects in tropical ecosystems.

This article is part of the theme issue ‘Towards a toolkit for global insect biodiversity monitoring’.

## Full-text entities

- **Diseases:** katydid call (MESH:C565984), GT (MESH:D007815), katydid sounds (MESH:D012135)
- **Species:** Anaulacomera furcata (species) [taxon 2853494], Bacillus sp. AT (species) [taxon 1196779], Montezumina bradleyi (species) [taxon 2853506], Ectemna dumicola (species) [taxon 2853529], Euceraia insignis (species) [taxon 2853504], Anaulacomera spatulata (species) [taxon 2853503], Euceraia atryx (species) [taxon 2853501], Balboana tibialis (species) [taxon 1395712], Ischnomela pulchripennis (species) [taxon 1930995], Anapolisia colossea (species) [taxon 2853488], Homo sapiens (human, species) [taxon 9606], Chiroptera (bats, order) [taxon 9397]
- **Cell lines:** S2 — Drosophila melanogaster (Fruit fly), Spontaneously immortalized cell line (CVCL_Z232)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11070257/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11070257/full.md

## References

76 references — full list in the complete paper: https://tomesphere.com/paper/PMC11070257/full.md

---
Source: https://tomesphere.com/paper/PMC11070257