A Multi-Stage Optimization Framework for Deploying Learned Image Compression on FPGAs

Jiaxun Fang; Li Chen

arXiv:2511.17135·cs.CV·November 24, 2025

A Multi-Stage Optimization Framework for Deploying Learned Image Compression on FPGAs

Jiaxun Fang, Li Chen

PDF

Open Access

TL;DR

This paper introduces a multi-stage optimization framework that enables efficient deployment of deep learning-based image compression models on FPGAs by combining quantization, pruning, and layer-specific bit-width optimization.

Contribution

The work presents a novel Dynamic Range-Aware Quantization method and hardware-aware optimization techniques tailored for FPGA deployment of LIC models, improving efficiency without sacrificing quality.

Findings

01

BD-rate overhead reduced from 30% to 6.3% with DRAQ

02

Over 20% reduction in computational complexity through hardware-aware optimizations

03

Achieved state-of-the-art efficiency and quality in FPGA-based LIC implementations

Abstract

Deep learning-based image compression (LIC) has achieved state-of-the-art rate-distortion (RD) performance, yet deploying these models on resource-constrained FPGAs remains a major challenge. This work presents a complete, multi-stage optimization framework to bridge the gap between high-performance floating-point models and efficient, hardware-friendly integer-based implementations. First, we address the fundamental problem of quantization-induced performance degradation. We propose a Dynamic Range-Aware Quantization (DRAQ) method that uses statistically-calibrated activation clipping and a novel weight regularization scheme to counteract the effects of extreme data outliers and large dynamic ranges, successfully creating a high-fidelity 8-bit integer model. Second, building on this robust foundation, we introduce two hardware-aware optimization techniques tailored for FPGAs. A…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Data Compression Techniques · Video Coding and Compression Technologies · Numerical Methods and Algorithms