ChronoSC: Task-Oriented Semantic Communication via Temporal-to-Color Encoding

Phuc H. Nguyen; Trung T. Nguyen; Quy N. Duong; and Van-Dinh Nguyen

arXiv:2605.16388·cs.CV·May 19, 2026

ChronoSC: Task-Oriented Semantic Communication via Temporal-to-Color Encoding

Phuc H. Nguyen, Trung T. Nguyen, Quy N. Duong, and Van-Dinh Nguyen

PDF

TL;DR

ChronoSC introduces a lightweight, lossless encoding scheme that compresses temporal video information into a static image for efficient task-oriented semantic communication in VideoQA, significantly reducing bandwidth.

Contribution

It proposes Chrono-Color Stacking and DeepJSCC for extreme temporal compression and explicit visual reconstruction, enabling effective VideoQA with minimal bandwidth.

Findings

01

Achieves up to 192x bandwidth reduction on CLEVRER dataset.

02

Maintains high VideoQA accuracy despite extreme compression.

03

Enables reuse of pre-trained vision-language models with reconstructed images.

Abstract

Semantic communication (SC) aims to reduce transmission overhead by conveying task-relevant information rather than raw data. However, existing SC approaches for video largely focus on pixel-level reconstruction or rely on complex spatiotemporal pipelines, leading to excessive bandwidth usage and latency that are unsuitable for low-resource deployments. In this paper, we propose ChronoSC, a task-oriented semantic communication framework for Video Question Answering (VideoQA). ChronoSC introduces Chrono-Color Stacking, a lightweight and lossless projection scheme that encodes temporal video dynamics into a single static image, enabling extreme temporal compression before transmission. This compact semantic representation is transmitted using a lightweight Deep Joint Source-Channel Coding (DeepJSCC) transceiver and explicitly reconstructed at the receiver. Unlike latent-space methods,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.