EngravingGNN: A Hybrid Graph Neural Network for End-to-End Piano Score Engraving
By: Emmanouil Karystinaios, Francesco Foscarin, Gerhard Widmer
Potential Business Impact:
Turns music into written notes automatically.
This paper focuses on automatic music engraving, i.e., the creation of a humanly-readable musical score from musical content. This step is fundamental for all applications that include a human player, but it remains a mostly unexplored topic in symbolic music processing. In this work, we formalize the problem as a collection of interdependent subtasks, and propose a unified graph neural network (GNN) framework that targets the case of piano music and quantized symbolic input. Our method employs a multi-task GNN to jointly predict voice connections, staff assignments, pitch spelling, key signature, stem direction, octave shifts, and clef signs. A dedicated postprocessing pipeline generates print-ready MusicXML/MEI outputs. Comprehensive evaluation on two diverse piano corpora (J-Pop and DCML Romantic) demonstrates that our unified model achieves good accuracy across all subtasks, compared to existing systems that only specialize in specific subtasks. These results indicate that a shared GNN encoder with lightweight task-specific decoders in a multi-task setting offers a scalable and effective solution for automatic music engraving.
Similar Papers
AnalysisGNN: Unified Music Analysis with Graph Neural Networks
Sound
Helps computers understand music better.
A Hybrid Supervised and Self-Supervised Graph Neural Network for Edge-Centric Applications
Machine Learning (CS)
Finds hidden connections between things.
Towards An Integrated Approach for Expressive Piano Performance Synthesis from Music Scores
Sound
Makes music sound like a real piano player.