COLI: A Hierarchical Efficient Compressor for Large Images

Haoran Wang; Hanyu Pei; Yang Lyu; Kai Zhang; Li Li; Feng-Lei Fan

arXiv:2507.11443·cs.CV·November 25, 2025

COLI: A Hierarchical Efficient Compressor for Large Images

Haoran Wang, Hanyu Pei, Yang Lyu, Kai Zhang, Li Li, Feng-Lei Fan

PDF

Open Access

TL;DR

COLI introduces a hierarchical compression framework for large images that combines accelerated neural representation training with a novel hyper-compression technique, achieving high-quality results with reduced storage and faster processing.

Contribution

The paper presents COLI, a new method that significantly improves the efficiency and compression ratios of INR-based large image compression through pretraining, parallel training, and hyper-compression.

Findings

01

Achieves superior PSNR and SSIM at lower bits per pixel.

02

Speeds up NeRV training by up to 4 times.

03

Maintains minimal distortion with high compression ratios.

Abstract

The escalating adoption of high-resolution, large-field-of-view imagery amplifies the need for efficient compression methodologies. Conventional techniques frequently fail to preserve critical image details, while data-driven approaches exhibit limited generalizability. Implicit Neural Representations (INRs) present a promising alternative by learning continuous mappings from spatial coordinates to pixel intensities for individual images, thereby storing network weights rather than raw pixels and avoiding the generalization problem. However, INR-based compression of large images faces challenges including slow compression speed and suboptimal compression ratios. To address these limitations, we introduce COLI (Compressor for Large Images), a novel framework leveraging Neural Representations for Videos (NeRV). First, recognizing that INR-based compression constitutes a training process,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputer Graphics and Visualization Techniques · Advanced Vision and Imaging · CCD and CMOS Imaging Sensors

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings