Wireless Deep Video Semantic Transmission
Sixian Wang, Jincheng Dai, Zijian Liang, Kai Niu, Zhongwei Si, Chao, Dong, Xiaoqi Qin, Ping Zhang

TL;DR
This paper introduces a novel deep joint source-channel coding framework called DVST for wireless video transmission, leveraging semantic feature extraction and adaptive rate control to improve efficiency and quality over traditional methods.
Contribution
The paper presents a new deep learning-based semantic transmission framework that adaptively encodes video features for wireless channels, outperforming traditional schemes in rate-distortion performance.
Findings
DVST surpasses traditional wireless video coding schemes in experiments.
The framework effectively adapts to different video content and communication scenarios.
DVST supports future semantic communication with content-aware and task-integrated capabilities.
Abstract
In this paper, we design a new class of high-efficiency deep joint source-channel coding methods to achieve end-to-end video transmission over wireless channels. The proposed methods exploit nonlinear transform and conditional coding architecture to adaptively extract semantic features across video frames, and transmit semantic feature domain representations over wireless channels via deep joint source-channel coding. Our framework is collected under the name deep video semantic transmission (DVST). In particular, benefiting from the strong temporal prior provided by the feature domain context, the learned nonlinear transform function becomes temporally adaptive, resulting in a richer and more accurate entropy model guiding the transmission of current frame. Accordingly, a novel rate adaptive transmission mechanism is developed to customize deep joint source-channel coding for video…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Speech and Audio Processing · Indoor and Outdoor Localization Technologies
