Multi-Task Learning for Screen Content Image Coding

Rashid Zamanshoar Heris; Ivan V. Baji\'c

arXiv:2302.02014·eess.IV·February 7, 2023·ISCA

Multi-Task Learning for Screen Content Image Coding

Rashid Zamanshoar Heris, Ivan V. Baji\'c

PDF

Open Access 1 Repo

TL;DR

This paper introduces a learning-based image coding model specifically designed for screen content images that contain both synthetic and natural regions, enhancing compression efficiency by jointly learning reconstruction and segmentation.

Contribution

The paper presents a novel multi-task learning approach that produces a segmentation-friendly latent representation for improved screen content image coding, applicable even without the segmentation task during inference.

Findings

01

The proposed codec outperforms traditional methods on mixed-content SCIs.

02

Joint training for reconstruction and segmentation improves compression quality.

03

Segmentation information enhances the latent representation for better coding efficiency.

Abstract

With the rise of remote work and collaboration, compression of screen content images (SCI) is becoming increasingly important. While there are efficient codecs for natural images, as well as codecs for purely-synthetic images, those SCIs that contain both synthetic and natural content pose a particular challenge. In this paper, we propose a learning-based image coding model developed for such SCIs. By training an encoder to provide a latent representation suitable for two tasks -- input reconstruction and synthetic/natural region segmentation -- we create an effective SCI image codec whose strong performance is verified through experiments. Once trained, the second task (segmentation) need not be used; the codec still benefits from the segmentation-friendly latent representation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sfu-multimedia-lab/mlscic
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Data Compression Techniques · Video Coding and Compression Technologies · Advanced Steganography and Watermarking Techniques