Identifying birdsong syllables without labelled data
By: Mélisande Teng , Julien Boussard , David Rolnick and more
Potential Business Impact:
Teaches computers to understand bird songs.
Identifying sequences of syllables within birdsongs is key to tackling a wide array of challenges, including bird individual identification and better understanding of animal communication and sensory-motor learning. Recently, machine learning approaches have demonstrated great potential to alleviate the need for experts to label long audio recordings by hand. However, they still typically rely on the availability of labelled data for model training, restricting applicability to a few species and datasets. In this work, we build the first fully unsupervised algorithm to decompose birdsong recordings into sequences of syllables. We first detect syllable events, then cluster them to extract templates -- syllable representations -- before performing matching pursuit to decompose the recording as a sequence of syllables. We evaluate our automatic annotations against human labels on a dataset of Bengalese finch songs and find that our unsupervised method achieves high performance. We also demonstrate that our approach can distinguish individual birds within a species through their unique vocal signatures, for both Bengalese finches and another species, the great tit.
Similar Papers
A Bird Song Detector for improving bird identification through Deep Learning: a case study from Doñana
Sound
Helps scientists find birds by listening to sounds.
Unsupervised outlier detection to improve bird audio dataset labels
Machine Learning (CS)
Cleans bird sounds so computers can learn them.
An Automated Pipeline for Few-Shot Bird Call Classification: A Case Study with the Tooth-Billed Pigeon
Machine Learning (CS)
Helps find rare birds with just a few sounds.