Melody-Lyrics Matching with Contrastive Alignment Loss
By: Changhong Wang, Michel Olvera, Gaël Richard
Potential Business Impact:
Finds matching words for a song's tune.
The connection between music and lyrics is far beyond semantic bonds. Conceptual pairs in the two modalities such as rhythm and rhyme, note duration and syllabic stress, and structure correspondence, raise a compelling yet seldom-explored direction in the field of music information retrieval. In this paper, we present melody-lyrics matching (MLM), a new task which retrieves potential lyrics for a given symbolic melody from text sources. Rather than generating lyrics from scratch, MLM essentially exploits the relationships between melody and lyrics. We propose a self-supervised representation learning framework with contrastive alignment loss for melody and lyrics. This has the potential to leverage the abundance of existing songs with paired melody and lyrics. No alignment annotations are required. Additionally, we introduce sylphone, a novel representation for lyrics at syllable-level activated by phoneme identity and vowel stress. We demonstrate that our method can match melody with coherent and singable lyrics with empirical results and intuitive examples. We open source code and provide matching examples on the companion webpage: https://github.com/changhongw/mlm.
Similar Papers
Versatile Symbolic Music-for-Music Modeling via Function Alignment
Sound
AI writes music by learning music's own language.
Large Language Models' Internal Perception of Symbolic Music
Computation and Language
Computers learn music from text descriptions.
AudioCodecBench: A Comprehensive Benchmark for Audio Codec Evaluation
Sound
Helps computers understand sounds and music better.