Machines Serve Human: A Novel Variable Human-machine Collaborative Compression Framework

Zifu Zhang; Shengxi Li; Xiancheng Sun; Mai Xu; Zhengyuan Liu; Jingyuan Xia

arXiv:2511.08915·cs.CV·November 13, 2025

Machines Serve Human: A Novel Variable Human-machine Collaborative Compression Framework

Zifu Zhang, Shengxi Li, Xiancheng Sun, Mai Xu, Zhengyuan Liu, Jingyuan Xia

PDF

Open Access

TL;DR

This paper introduces a novel collaborative compression framework where machine vision guides human vision compression, utilizing a diffusion prior to enhance detail restoration, resulting in superior performance over existing methods.

Contribution

It presents the first machine-vision-oriented collaborative compression method, with a plug-and-play variable bit-rate strategy and diffusion-prior based feature aggregation for improved image/video compression.

Findings

01

Superior performance on machine-vision compression tasks

02

Enhanced human-vision image/video quality

03

Effective semantic aggregation from machine to human vision

Abstract

Human-machine collaborative compression has been receiving increasing research efforts for reducing image/video data, serving as the basis for both human perception and machine intelligence. Existing collaborative methods are dominantly built upon the de facto human-vision compression pipeline, witnessing deficiency on complexity and bit-rates when aggregating the machine-vision compression. Indeed, machine vision solely focuses on the core regions within the image/video, requiring much less information compared with the compressed information for human vision. In this paper, we thus set out the first successful attempt by a novel collaborative compression method based on the machine-vision-oriented compression, instead of human-vision pipeline. In other words, machine vision serves as the basis for human vision within collaborative compression. A plug-and-play variable bit-rate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Data Compression Techniques · Video Coding and Compression Technologies · Image and Video Quality Assessment