The Inverse Drum Machine: Source Separation Through Joint Transcription and Analysis-by-Synthesis
By: Bernardo Torres, Geoffroy Peeters, Gael Richard
Potential Business Impact:
Separates music into drum sounds without needing full songs.
We introduce the Inverse Drum Machine (IDM), a novel approach to drum source separation that combines analysis-by-synthesis with deep learning. Unlike recent supervised methods that rely on isolated stems, IDM requires only transcription annotations. It jointly optimizes automatic drum transcription and one-shot drum sample synthesis in an end-to-end framework. By convolving synthesized one-shot samples with estimated onsets-mimicking a drum machine-IDM reconstructs individual drum stems and trains a neural network to match the original mixture. Evaluations on the StemGMD dataset show that IDM achieves separation performance on par with state-of-the-art supervised methods, while substantially outperforming matrix decomposition baselines.
Similar Papers
MGE-LDM: Joint Latent Diffusion for Simultaneous Music Generation and Source Extraction
Sound
Creates and separates any music sounds.
Musical Source Separation of Brazilian Percussion
Audio and Speech Processing
Separates samba drums from music using smart computer.
Enhanced Automatic Drum Transcription via Drum Stem Source Separation
Sound
Makes drum music sound more real.