Synthetic Swarm Mosquito Dataset for Acoustic Classification: A Proof of Concept
Thai-Duy Dinh, Minh-Luan Vo, Cuong Tuan Nguyen, Bich-Hien Vo

TL;DR
This paper presents a synthetic mosquito swarm audio dataset to improve acoustic classification of mosquito species, enabling scalable data generation and effective deployment on low-power devices for real-time surveillance.
Contribution
It introduces a novel synthetic dataset for mosquito acoustic classification, reducing the need for labor-intensive recordings and facilitating scalable, real-time mosquito monitoring.
Findings
Deep learning models accurately classify six mosquito species.
Synthetic data enables effective model training with less resource use.
Models are suitable for deployment on embedded low-power devices.
Abstract
Mosquito-borne diseases pose a serious global health threat, causing over 700,000 deaths annually. This work introduces a proof-of-concept Synthetic Swarm Mosquito Dataset for Acoustic Classification, created to simulate realistic multi-species and noisy swarm conditions. Unlike conventional datasets that require labor-intensive recording of individual mosquitoes, the synthetic approach enables scalable data generation while reducing human resource demands. Using log-mel spectrograms, we evaluated lightweight deep learning architectures for the classification of mosquito species. Experiments show that these models can effectively identify six major mosquito vectors and are suitable for deployment on embedded low-power devices. The study demonstrates the potential of synthetic swarm audio datasets to accelerate acoustic mosquito research and enable scalable real-time surveillance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnimal Vocal Communication and Behavior · Smart Agriculture and AI · Digital Imaging for Blood Diseases
