InsectSet459: an open dataset of insect sounds for bioacoustic machine learning
By: Marius Faiß, Burooj Ghani, Dan Stowell
Potential Business Impact:
Helps computers identify bugs by their sounds.
Automatic recognition of insect sound could help us understand changing biodiversity trends around the world -- but insect sounds are challenging to recognize even for deep learning. We present a new dataset comprised of 26399 audio files, from 459 species of Orthoptera and Cicadidae. It is the first large-scale dataset of insect sound that is easily applicable for developing novel deep-learning methods. Its recordings were made with a variety of audio recorders using varying sample rates to capture the extremely broad range of frequencies that insects produce. We benchmark performance with two state-of-the-art deep learning classifiers, demonstrating good performance but also significant room for improvement in acoustic insect classification. This dataset can serve as a realistic test case for implementing insect monitoring workflows, and as a challenging basis for the development of audio representation methods that can handle highly variable frequencies and/or sample rates.
Similar Papers
ECOSoundSet: a finely annotated dataset for the automated acoustic identification of Orthoptera and Cicadidae in North, Central and temperate Western Europe
Sound
Helps computers identify insect sounds in nature.
The iNaturalist Sounds Dataset
Sound
Helps computers identify animal sounds worldwide.
Open-Set Recognition of Novel Species in Biodiversity Monitoring
CV and Pattern Recognition
Finds new bugs and animals using pictures.