Lightweight Embedded FPGA Deployment of Learned Image Compression with Knowledge Distillation and Hybrid Quantization
By: Alaa Mazouz , Sumanta Chaudhuri , Marco Cagnanzzo and more
Potential Business Impact:
Makes video files smaller with less effort.
Learnable Image Compression (LIC) has shown the potential to outperform standardized video codecs in RD efficiency, prompting the research for hardware-friendly implementations. Most existing LIC hardware implementations prioritize latency to RD-efficiency and through an extensive exploration of the hardware design space. We present a novel design paradigm where the burden of tuning the design for a specific hardware platform is shifted towards model dimensioning and without compromising on RD-efficiency. First, we design a framework for distilling a leaner student LIC model from a reference teacher: by tuning a single model hyperparameters, we can meet the constraints of different hardware platforms without a complex hardware design exploration. Second, we propose a hardware-friendly implementation of the Generalized Divisive Normalization - GDN activation that preserves RD efficiency even post parameter quantization. Third, we design a pipelined FPGA configuration which takes full advantage of available FPGA resources by leveraging parallel processing and optimizing resource allocation. Our experiments with a state of the art LIC model show that we outperform all existing FPGA implementations while performing very close to the original model.
Similar Papers
A Multi-Stage Optimization Framework for Deploying Learned Image Compression on FPGAs
CV and Pattern Recognition
Makes phone pictures smaller without losing quality.
Security and Real-time FPGA integration for Learned Image Compression
Cryptography and Security
Makes pictures smaller and keeps them safe.
Efficient Learned Image Compression Through Knowledge Distillation
CV and Pattern Recognition
Makes AI image compression faster and use less power.