Goal-Oriented Framework for Optical Flow-based Multi-User Multi-Task Video Transmission

Yujie Xu; Shutong Chen; Nan Li; Yansha Deng; Jinhong Yuan; and Robert Schober

arXiv:2603.19995·eess.IV·March 26, 2026

Goal-Oriented Framework for Optical Flow-based Multi-User Multi-Task Video Transmission

Yujie Xu, Shutong Chen, Nan Li, Yansha Deng, Jinhong Yuan, and Robert Schober

PDF

Open Access

TL;DR

This paper introduces a goal-oriented semantic communication framework for multi-user multi-task video transmission using optical flow, improving video quality, classification accuracy, and bandwidth efficiency in wireless systems.

Contribution

The paper presents a novel OF-GSC framework with a semantic encoder, transformer decoder, and DDPG-based bandwidth allocation, enhancing multi-task video transmission performance.

Findings

01

13.47% increase in SSIM for video reconstruction

02

Top-1 accuracy surpassing VideoMAE with 25% data

03

Bandwidth reduction of 25.97% with DDPG algorithm

Abstract

Efficient multi-user multi-task video transmission is an important research topic within the realm of current wireless communication systems. To reduce the transmission burden and save communication resources, we propose a goal-oriented semantic communication framework for optical flow-based multi-user multi-task video transmission (OF-GSC). At the transmitter, we design a semantic encoder that consists of a motion extractor and a patch-level optical flow-based semantic representation extractor to effectively identify and select important semantic representations. At the receiver, we design a transformer-based semantic decoder for high-quality video reconstruction and video classification tasks. To minimize the communication time, we develop a deep deterministic policy gradient (DDPG)-based bandwidth allocation algorithm for multi-user transmission. For video reconstruction tasks, our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Coding and Compression Technologies · Image and Video Quality Assessment · Human Pose and Action Recognition