New VVC profiles targeting Feature Coding for Machines
By: Md Eimran Hossain Eimon , Ashan Perera , Juan Merlos and more
Modern video codecs have been extensively optimized to preserve perceptual quality, leveraging models of the human visual system. However, in split inference systems-where intermediate features from neural network are transmitted instead of pixel data-these assumptions no longer apply. Intermediate features are abstract, sparse, and task-specific, making perceptual fidelity irrelevant. In this paper, we investigate the use of Versatile Video Coding (VVC) for compressing such features under the MPEG-AI Feature Coding for Machines (FCM) standard. We perform a tool-level analysis to understand the impact of individual coding components on compression efficiency and downstream vision task accuracy. Based on these insights, we propose three lightweight essential VVC profiles-Fast, Faster, and Fastest. The Fast profile provides 2.96% BD-Rate gain while reducing encoding time by 21.8%. Faster achieves a 1.85% BD-Rate gain with a 51.5% speedup. Fastest reduces encoding time by 95.6% with only a 1.71% loss in BD-Rate.
Similar Papers
Real-Time Neural Video Compression with Unified Intra and Inter Coding
CV and Pattern Recognition
Makes videos smaller with better quality.
Block-Partitioning Strategies for Accelerated Multi-rate Encoding in Adaptive VVC Streaming
Multimedia
Makes video streaming faster and better quality.
A Lightweight Dual-Mode Optimization for Generative Face Video Coding
CV and Pattern Recognition
Makes video calls look better on phones.