OMR-NET: a two-stage octave multi-scale residual network for screen   content image compression

Shiqi Jiang; Ting Ren; Congrui Fu; Shuai Li; Hui Yuan

arXiv:2407.08545·eess.IV·July 12, 2024

OMR-NET: a two-stage octave multi-scale residual network for screen content image compression

Shiqi Jiang, Ting Ren, Congrui Fu, Shuai Li, Hui Yuan

PDF

TL;DR

This paper introduces OMR-NET, a novel two-stage multi-scale residual network tailored for screen content image compression, effectively capturing high-contrast and repetitive patterns to outperform existing learned image compression methods.

Contribution

It proposes a new two-stage octave convolutional residual architecture with attention mechanisms and a diverse SC dataset, specifically optimized for screen content image compression.

Findings

01

Outperforms existing LIC methods in rate-distortion on SC images.

02

Effective multi-scale learning improves compression quality.

03

New dataset enhances training for screen content images.

Abstract

Screen content (SC) differs from natural scene (NS) with unique characteristics such as noise-free, repetitive patterns, and high contrast. Aiming at addressing the inadequacies of current learned image compression (LIC) methods for SC, we propose an improved two-stage octave convolutional residual blocks (IToRB) for high and low-frequency feature extraction and a cascaded two-stage multi-scale residual blocks (CTMSRB) for improved multi-scale learning and nonlinearity in SC. Additionally, we employ a window-based attention module (WAM) to capture pixel correlations, especially for high contrast regions in the image. We also construct a diverse SC image compression dataset (SDU-SCICD2K) for training, including text, charts, graphics, animation, movie, game and mixture of SC images and NS images. Experimental results show our method, more suited for SC than NS data, outperforms existing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSoftmax · Attention Is All You Need