A Multi-Stage Optimization Framework for Deploying Learned Image Compression on FPGAs
Jiaxun Fang, Li Chen

TL;DR
This paper introduces a multi-stage optimization framework that enables efficient deployment of deep learning-based image compression models on FPGAs by combining quantization, pruning, and layer-specific bit-width optimization.
Contribution
The work presents a novel Dynamic Range-Aware Quantization method and hardware-aware optimization techniques tailored for FPGA deployment of LIC models, improving efficiency without sacrificing quality.
Findings
BD-rate overhead reduced from 30% to 6.3% with DRAQ
Over 20% reduction in computational complexity through hardware-aware optimizations
Achieved state-of-the-art efficiency and quality in FPGA-based LIC implementations
Abstract
Deep learning-based image compression (LIC) has achieved state-of-the-art rate-distortion (RD) performance, yet deploying these models on resource-constrained FPGAs remains a major challenge. This work presents a complete, multi-stage optimization framework to bridge the gap between high-performance floating-point models and efficient, hardware-friendly integer-based implementations. First, we address the fundamental problem of quantization-induced performance degradation. We propose a Dynamic Range-Aware Quantization (DRAQ) method that uses statistically-calibrated activation clipping and a novel weight regularization scheme to counteract the effects of extreme data outliers and large dynamic ranges, successfully creating a high-fidelity 8-bit integer model. Second, building on this robust foundation, we introduce two hardware-aware optimization techniques tailored for FPGAs. A…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Compression Techniques · Video Coding and Compression Technologies · Numerical Methods and Algorithms
