Score: 1

Exploring System Adaptations For Minimum Latency Real-Time Piano Transcription

Published: September 9, 2025 | arXiv ID: 2509.07586v1

By: Patricia Hu , Silvan David Peter , Jan Schlüter and more

Potential Business Impact:

Lets computers hear piano music instantly.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Advances in neural network design and the availability of large-scale labeled datasets have driven major improvements in piano transcription. Existing approaches target either offline applications, with no restrictions on computational demands, or online transcription, with delays of 128-320 ms. However, most real-time musical applications require latencies below 30 ms. In this work, we investigate whether and how the current state-of-the-art online transcription model can be adapted for real-time piano transcription. Specifically, we eliminate all non-causal processing, and reduce computational load through shared computations across core model components and variations in model size. Additionally, we explore different pre- and postprocessing strategies, and related label encoding schemes, and discuss their suitability for real-time transcription. Evaluating the adaptions on the MAESTRO dataset, we find a drop in transcription accuracy due to strictly causal processing as well as a tradeoff between the preprocessing latency and prediction accuracy. We release our system as a baseline to support researchers in designing models towards minimum latency real-time transcription.

TART: A Comprehensive Tool for Technique-Aware Audio-to-Tab Guitar Transcription

Sound

Turns guitar music into written notes.

2 Oct 2025 0

87%

Joint Estimation of Piano Dynamics and Metrical Structure with a Multi-task Multi-Scale Network

Audio and Speech Processing

Helps computers understand piano music's loudness.

21 Oct 2025 0

87%

Efficient Transformer-Based Piano Transcription With Sparse Attention Mechanisms

Sound

Turns piano music into notes faster.

11 Sep 2025 2

View PDF Login to Bookmark

Country of Origin

🇦🇹 Austria

Page Count

8 pages

Exploring System Adaptations For Minimum Latency Real-Time Piano Transcription

Lets computers hear piano music instantly.

Technical Abstract

TART: A Comprehensive Tool for Technique-Aware Audio-to-Tab Guitar Transcription

Joint Estimation of Piano Dynamics and Metrical Structure with a Multi-task Multi-Scale Network

Efficient Transformer-Based Piano Transcription With Sparse Attention Mechanisms