Perception-Aware Video Semantic Communication

Yinhuan Huang; Zhijin Qin

arXiv:2605.19397·eess.IV·May 20, 2026

Perception-Aware Video Semantic Communication

Yinhuan Huang, Zhijin Qin

PDF

TL;DR

This paper introduces PVSC, a perception-aware video semantic communication framework that significantly reduces bandwidth usage while maintaining high perceptual quality for real-time wireless video streaming.

Contribution

PVSC is a novel framework that eliminates explicit motion-vector transmission and uses spatio-temporal feature coding for efficient, perception-aligned wireless video transmission.

Findings

01

PVSC saves up to 75% bandwidth compared to baseline.

02

PVSC achieves comparable perceptual quality with less bandwidth.

03

PVSC enables real-time inference on a single GPU.

Abstract

Ultra-high-resolution streaming and emerging immersive services are driving rapidly increasing wireless video traffic. However, perceptually pleasing video transmission over bandwidth-limited and latency-constrained wireless links remains challenging for conventional separated source-channel systems, which primarily target bit-level reliability and often suffer performance degradation under short-blocklength transmission. In addition, pixel-level distortion optimization does not necessarily align with human perception, while existing learned video codecs may incur high complexity and raise deployment issues. This paper proposes PVSC, a perception-aware video semantic communication framework for real-time wireless video transmission. PVSC eliminates explicit motion-vector transmission and exploits spatio-temporal feature coding to generate compact and channel-robust symbol streams. It…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.