C3: High-performance and low-complexity neural compression from a single   image or video

Hyunjik Kim; Matthias Bauer; Lucas Theis; Jonathan Richard Schwarz,; Emilien Dupont

arXiv:2312.02753·eess.IV·December 6, 2023·1 cites

C3: High-performance and low-complexity neural compression from a single image or video

Hyunjik Kim, Matthias Bauer, Lucas Theis, Jonathan Richard Schwarz,, Emilien Dupont

PDF

Open Access 1 Repo

TL;DR

C3 is a neural compression method that overfits a small model to each image or video, achieving high rate-distortion performance with significantly lower decoding complexity compared to existing neural codecs.

Contribution

The paper introduces C3, a neural compression approach that overfits a small model per data item, reducing decoding complexity while maintaining competitive RD performance, and extends it to videos.

Findings

01

Matches VTM RD performance on CLIC2020 with <3k MACs/pixel

02

Matches Video Compression Transformer RD on UVG with <5k MACs/pixel

03

Decoding complexity is an order of magnitude lower than neural baselines

Abstract

Most neural compression models are trained on large datasets of images or videos in order to generalize to unseen data. Such generalization typically requires large and expressive architectures with a high decoding complexity. Here we introduce C3, a neural compression method with strong rate-distortion (RD) performance that instead overfits a small model to each image or video separately. The resulting decoding complexity of C3 can be an order of magnitude lower than neural baselines with similar RD performance. C3 builds on COOL-CHIC (Ladune et al.) and makes several simple and effective improvements for images. We further develop new methodology to apply C3 to videos. On the CLIC2020 image benchmark, we match the RD performance of VTM, the reference implementation of the H.266 codec, with less than 3k MACs/pixel for decoding. On the UVG video benchmark, we match the RD performance of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

orange-opensource/cool-chic
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Image and Signal Denoising Methods · Model Reduction and Neural Networks

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Absolute Position Encodings · Dropout · Dense Connections · Byte Pair Encoding · Softmax · Layer Normalization · Position-Wise Feed-Forward Layer