Synthetic Swarm Mosquito Dataset for Acoustic Classification: A Proof of Concept
By: Thai-Duy Dinh , Minh-Luan Vo , Cuong Tuan Nguyen and more
Mosquito-borne diseases pose a serious global health threat, causing over 700,000 deaths annually. This work introduces a proof-of-concept Synthetic Swarm Mosquito Dataset for Acoustic Classification, created to simulate realistic multi-species and noisy swarm conditions. Unlike conventional datasets that require labor-intensive recording of individual mosquitoes, the synthetic approach enables scalable data generation while reducing human resource demands. Using log-mel spectrograms, we evaluated lightweight deep learning architectures for the classification of mosquito species. Experiments show that these models can effectively identify six major mosquito vectors and are suitable for deployment on embedded low-power devices. The study demonstrates the potential of synthetic swarm audio datasets to accelerate acoustic mosquito research and enable scalable real-time surveillance solutions.
Similar Papers
Learning Domain-Robust Bioacoustic Representations for Mosquito Species Classification with Contrastive Learning and Distribution Alignment
Audio and Speech Processing
Identifies mosquito species from sounds anywhere.
A Multiclass Acoustic Dataset and Interactive Tool for Analyzing Drone Signatures in Real-World Environments
Sound
Listens for drones by their sounds.
InsectSet459: an open dataset of insect sounds for bioacoustic machine learning
Sound
Helps computers identify bugs by their sounds.