Feature Coding for Scalable Machine Vision
By: Md Eimran Hossain Eimon , Juan Merlos , Ashan Perera and more
Potential Business Impact:
Shrinks computer vision data for faster, private use.
Deep neural networks (DNNs) drive modern machine vision but are challenging to deploy on edge devices due to high compute demands. Traditional approaches-running the full model on-device or offloading to the cloud face trade-offs in latency, bandwidth, and privacy. Splitting the inference workload between the edge and the cloud offers a balanced solution, but transmitting intermediate features to enable such splitting introduces new bandwidth challenges. To address this, the Moving Picture Experts Group (MPEG) initiated the Feature Coding for Machines (FCM) standard, establishing a bitstream syntax and codec pipeline tailored for compressing intermediate features. This paper presents the design and performance of the Feature Coding Test Model (FCTM), showing significant bitrate reductions-averaging 85.14%-across multiple vision tasks while preserving accuracy. FCM offers a scalable path for efficient and interoperable deployment of intelligent features in bandwidth-limited and privacy-sensitive consumer applications.
Similar Papers
Enabling Next-Generation Consumer Experience with Feature Coding for Machines
CV and Pattern Recognition
Lets small gadgets use big smart computer brains.
Emerging Standards for Machine-to-Machine Video Coding
CV and Pattern Recognition
Lets computers share video data faster and safer.
New VVC profiles targeting Feature Coding for Machines
CV and Pattern Recognition
Compresses computer "thoughts" for faster AI.