Perception-Aware Video Semantic Communication
Yinhuan Huang, Zhijin Qin

TL;DR
This paper introduces PVSC, a perception-aware video semantic communication framework that significantly reduces bandwidth usage while maintaining high perceptual quality for real-time wireless video streaming.
Contribution
PVSC is a novel framework that eliminates explicit motion-vector transmission and uses spatio-temporal feature coding for efficient, perception-aligned wireless video transmission.
Findings
PVSC saves up to 75% bandwidth compared to baseline.
PVSC achieves comparable perceptual quality with less bandwidth.
PVSC enables real-time inference on a single GPU.
Abstract
Ultra-high-resolution streaming and emerging immersive services are driving rapidly increasing wireless video traffic. However, perceptually pleasing video transmission over bandwidth-limited and latency-constrained wireless links remains challenging for conventional separated source-channel systems, which primarily target bit-level reliability and often suffer performance degradation under short-blocklength transmission. In addition, pixel-level distortion optimization does not necessarily align with human perception, while existing learned video codecs may incur high complexity and raise deployment issues. This paper proposes PVSC, a perception-aware video semantic communication framework for real-time wireless video transmission. PVSC eliminates explicit motion-vector transmission and exploits spatio-temporal feature coding to generate compact and channel-robust symbol streams. It…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
