T-GVC: Trajectory-Guided Generative Video Coding at Ultra-Low Bitrates

Zhitao Wang; Hengyu Man; Wenrui Li; Xingtao Wang; Xiaopeng Fan; Debin Zhao

arXiv:2507.07633·cs.CV·November 12, 2025

T-GVC: Trajectory-Guided Generative Video Coding at Ultra-Low Bitrates

Zhitao Wang, Hengyu Man, Wenrui Li, Xingtao Wang, Xiaopeng Fan, Debin Zhao

PDF

1 Video

TL;DR

T-GVC introduces a novel generative video coding framework that combines semantic-aware sparse motion sampling with trajectory-aligned loss constraints, enabling ultra-low bitrate video compression with realistic motion and high semantic fidelity.

Contribution

It proposes a trajectory-guided approach that reduces bitrate while maintaining motion quality, integrating motion tracking with semantic understanding in a training-free guidance mechanism.

Findings

01

Outperforms traditional and neural codecs at ultra-low bitrates

02

Achieves more precise motion control than text-guided methods

03

Preserves critical semantic and motion details in compressed videos

Abstract

Recent advances in video generation techniques have given rise to an emerging paradigm of generative video coding for Ultra-Low Bitrate (ULB) scenarios by leveraging powerful generative priors. However, most existing methods are limited by domain specificity (e.g., facial or human videos) or excessive dependence on high-level text guidance, which tend to inadequately capture fine-grained motion details, leading to unrealistic or incoherent reconstructions. To address these challenges, we propose Trajectory-Guided Generative Video Coding (dubbed T-GVC), a novel framework that bridges low-level motion tracking with high-level semantic understanding. T-GVC features a semantic-aware sparse motion sampling pipeline that extracts pixel-wise motion as sparse trajectory points based on their semantic importance, significantly reducing the bitrate while preserving critical temporal semantic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

T-GVC: Trajectory-Guided Generative Video Coding at Ultra-Low Bitrates· underline