Score: 0

Learning from Limited Labels: Transductive Graph Label Propagation for Indian Music Analysis

Published: January 7, 2026 | arXiv ID: 2601.03626v1

By: Parampreet Singh , Akshay Raina , Sayeedul Islam Sheikh and more

Potential Business Impact:

**Labels music automatically, saving time and effort.**

Business Areas:
Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Supervised machine learning frameworks rely on extensive labeled datasets for robust performance on real-world tasks. However, there is a lack of large annotated datasets in audio and music domains, as annotating such recordings is resource-intensive, laborious, and often require expert domain knowledge. In this work, we explore the use of label propagation (LP), a graph-based semi-supervised learning technique, for automatically labeling the unlabeled set in an unsupervised manner. By constructing a similarity graph over audio embeddings, we propagate limited label information from a small annotated subset to a larger unlabeled corpus in a transductive, semi-supervised setting. We apply this method to two tasks in Indian Art Music (IAM): Raga identification and Instrument classification. For both these tasks, we integrate multiple public datasets along with additional recordings we acquire from Prasar Bharati Archives to perform LP. Our experiments demonstrate that LP significantly reduces labeling overhead and produces higher-quality annotations compared to conventional baseline methods, including those based on pretrained inductive models. These results highlight the potential of graph-based semi-supervised learning to democratize data annotation and accelerate progress in music information retrieval.

Page Count
12 pages

Category
Electrical Engineering and Systems Science:
Audio and Speech Processing