ChronoSC: Task-Oriented Semantic Communication via Temporal-to-Color Encoding
Phuc H. Nguyen, Trung T. Nguyen, Quy N. Duong, and Van-Dinh Nguyen

TL;DR
ChronoSC introduces a lightweight, lossless encoding scheme that compresses temporal video information into a static image for efficient task-oriented semantic communication in VideoQA, significantly reducing bandwidth.
Contribution
It proposes Chrono-Color Stacking and DeepJSCC for extreme temporal compression and explicit visual reconstruction, enabling effective VideoQA with minimal bandwidth.
Findings
Achieves up to 192x bandwidth reduction on CLEVRER dataset.
Maintains high VideoQA accuracy despite extreme compression.
Enables reuse of pre-trained vision-language models with reconstructed images.
Abstract
Semantic communication (SC) aims to reduce transmission overhead by conveying task-relevant information rather than raw data. However, existing SC approaches for video largely focus on pixel-level reconstruction or rely on complex spatiotemporal pipelines, leading to excessive bandwidth usage and latency that are unsuitable for low-resource deployments. In this paper, we propose ChronoSC, a task-oriented semantic communication framework for Video Question Answering (VideoQA). ChronoSC introduces Chrono-Color Stacking, a lightweight and lossless projection scheme that encodes temporal video dynamics into a single static image, enabling extreme temporal compression before transmission. This compact semantic representation is transmitted using a lightweight Deep Joint Source-Channel Coding (DeepJSCC) transceiver and explicitly reconstructed at the receiver. Unlike latent-space methods,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
