Exploring Perceptual Audio Quality Measurement on Stereo Processing Using the Open Dataset of Audio Quality
By: Pablo M. Delgado , Sascha Dick , Christoph Thompson and more
Potential Business Impact:
Improves how computers judge sound quality.
ODAQ (Open Dataset of Audio Quality) provides a comprehensive framework for exploring both monaural and binaural audio quality degradations across a range of distortion classes and signals, accompanied by subjective quality ratings. A recent update of ODAQ, focusing on the impact of stereo processing methods such as Mid/Side (MS) and Left/Right (LR), provides test signals and subjective ratings for the in-depth investigation of state-of-the-art objective audio quality metrics. Our evaluation results suggest that, while timbre-focused metrics often yield robust results under simpler conditions, their prediction performance tends to suffer under the conditions with a more complex presentation context. Our findings underscore the importance of modeling the interplay of bottom-up psychoacoustic processes and top-down contextual factors, guiding future research toward models that more effectively integrate both timbral and spatial dimensions of perceived audio quality.
Similar Papers
Evaluating Objective Speech Quality Metrics for Neural Audio Codecs
Sound
Helps pick the best way to test audio quality.
SpeechQualityLLM: LLM-Based Multimodal Assessment of Speech Quality
Sound
Lets computers explain why audio sounds bad.
Non-verbal Perception of Room Acoustics using Multi Dimensional Scaling Metho
Sound
Finds 5 ways rooms sound good for music.