ROI-Packing: Efficient Region-Based Compression for Machine Vision
By: Md Eimran Hossain Eimon , Alena Krause , Ashan Perera and more
Potential Business Impact:
Makes computer vision smarter by shrinking image files.
This paper introduces ROI-Packing, an efficient image compression method tailored specifically for machine vision. By prioritizing regions of interest (ROI) critical to end-task accuracy and packing them efficiently while discarding less relevant data, ROI-Packing achieves significant compression efficiency without requiring retraining or fine-tuning of end-task models. Comprehensive evaluations across five datasets and two popular tasks-object detection and instance segmentation-demonstrate up to a 44.10% reduction in bitrate without compromising end-task accuracy, along with an 8.88 % improvement in accuracy at the same bitrate compared to the state-of-the-art Versatile Video Coding (VVC) codec standardized by the Moving Picture Experts Group (MPEG).
Similar Papers
ROI-Guided Point Cloud Geometry Compression Towards Human and Machine Vision
CV and Pattern Recognition
Shrinks 3D scans without losing important details.
ROI-based Deep Image Compression with Implicit Bit Allocation
Image and Video Processing
Makes important picture parts clearer, saves space.
Region-Adaptive Video Sharpening via Rate-Perception Optimization
CV and Pattern Recognition
Makes videos clearer while saving space.