GenAI-enabled Residual Motion Estimation for Energy-Efficient Semantic Video Communication
Shavbo Salehi, Pedro Enrique Iturria-Rivera, Medhat Elsayed, Majid Bavand, Yigit Ozcan, Melike Erol-Kantarci

TL;DR
This paper introduces PENME, a neural motion estimation method for semantic video communication that reduces bandwidth, latency, and power consumption while maintaining high video quality through adaptive residual motion encoding and refinement.
Contribution
The paper presents a novel, predictability-aware, and entropy-adaptive neural motion estimation framework that improves resource efficiency in semantic video transmission.
Findings
Achieves 40% lower latency and 90% less transmitted data compared to traditional methods.
Improves PSNR by about 40% and MS-SSIM by roughly 19% over baseline techniques.
Reduces LPIPS by nearly 35%, indicating better perceptual quality.
Abstract
Semantic communication addresses the limitations of the Shannon paradigm by focusing on transmitting meaning rather than exact representations, thereby reducing unnecessary resource consumption. This is particularly beneficial for video, which dominates network traffic and demands high bandwidth and power, making semantic approaches ideal for conserving resources while maintaining quality. In this paper, we propose a Predictability-aware and Entropy-adaptive Neural Motion Estimation (PENME) method to address challenges related to high latency, high bitrate, and power consumption in video transmission. PENME makes per-frame decisions to select a residual motion extraction model, convolutional neural network, vision transformer, or optical flow, using a five-step policy based on motion strength, global motion consistency, peak sharpness, heterogeneity, and residual error. The residual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Coding and Compression Technologies · Image and Video Quality Assessment · Advanced Data Compression Techniques
