Leveraging Compression to Construct Transferable Bitrate Ladders
By: Krishna Srikar Durbha , Hassene Tmar , Ping-Hao Wu and more
Potential Business Impact:
Makes videos look better with less data.
Over the past few years, per-title and per-shot video encoding techniques have demonstrated significant gains as compared to conventional techniques such as constant CRF encoding and the fixed bitrate ladder. These techniques have demonstrated that constructing content-gnostic per-shot bitrate ladders can provide significant bitrate gains and improved Quality of Experience (QoE) for viewers under various network conditions. However, constructing a convex hull for every video incurs a significant computational overhead. Recently, machine learning-based bitrate ladder construction techniques have emerged as a substitute for convex hull construction. These methods operate by extracting features from source videos to train machine learning (ML) models to construct content-adaptive bitrate ladders. Here, we present a new ML-based bitrate ladder construction technique that accurately predicts the VMAF scores of compressed videos, by analyzing the compression procedure and by making perceptually relevant measurements on the source videos prior to compression. We evaluate the performance of our proposed framework against leading prior methods on a large corpus of videos. Since training ML models on every encoder setting is time-consuming, we also investigate how per-shot bitrate ladders perform under different encoding settings. We evaluate the performance of all models against the fixed bitrate ladder and the best possible convex hull constructed using exhaustive encoding with Bjontegaard-delta metrics.
Similar Papers
Content Adaptive Encoding For Interactive Game Streaming
Image and Video Processing
Makes game streaming look better with less delay.
Bitrate-Controlled Diffusion for Disentangling Motion and Content in Video
CV and Pattern Recognition
Separates video's movement from its pictures.
Degradation-Aware Hierarchical Termination for Blind Quality Enhancement of Compressed Video
CV and Pattern Recognition
Improves blurry videos without knowing how blurry they are.