Learning from Limited Labels: Transductive Graph Label Propagation for Indian Music Analysis
By: Parampreet Singh , Akshay Raina , Sayeedul Islam Sheikh and more
Potential Business Impact:
**Labels music automatically, saving time and effort.**
Supervised machine learning frameworks rely on extensive labeled datasets for robust performance on real-world tasks. However, there is a lack of large annotated datasets in audio and music domains, as annotating such recordings is resource-intensive, laborious, and often require expert domain knowledge. In this work, we explore the use of label propagation (LP), a graph-based semi-supervised learning technique, for automatically labeling the unlabeled set in an unsupervised manner. By constructing a similarity graph over audio embeddings, we propagate limited label information from a small annotated subset to a larger unlabeled corpus in a transductive, semi-supervised setting. We apply this method to two tasks in Indian Art Music (IAM): Raga identification and Instrument classification. For both these tasks, we integrate multiple public datasets along with additional recordings we acquire from Prasar Bharati Archives to perform LP. Our experiments demonstrate that LP significantly reduces labeling overhead and produces higher-quality annotations compared to conventional baseline methods, including those based on pretrained inductive models. These results highlight the potential of graph-based semi-supervised learning to democratize data annotation and accelerate progress in music information retrieval.
Similar Papers
Boosting Generic Semi-Supervised Medical Image Segmentation via Diverse Teaching and Label Propagation
CV and Pattern Recognition
Helps doctors see inside bodies better with less training data.
Bridging Domain Adaptation and Graph Neural Networks: A Tensor-Based Framework for Effective Label Propagation
Machine Learning (CS)
Teaches computers to learn from less labeled data.
ProLAP: Probabilistic Language-Audio Pre-Training
Audio and Speech Processing
Helps computers understand sounds and words better.