Vega-Video: Integrating Video into the Grammar of Graphics
Dominik Winecki, Arnab Nandi

TL;DR
This paper introduces Vega-Video, a system that integrates video data into the Vega visualization grammar, enabling high-performance, synchronized, and interactive video visualizations with real-time updates.
Contribution
It presents novel abstractions and a split-signal architecture that allow seamless, responsive, and declarative integration of video into data visualizations.
Findings
Achieved up to 4x responsiveness improvements during scrubbing interactions.
Delivered sub-200ms updates for multi-hour videos using VOD protocols.
Enabled high-performance, synchronized video visualization within Vega.
Abstract
Video data is increasingly used alongside conventional data for interactive data exploration, necessitating interfaces for exploring and presenting mixed-modality data. However, integrating video into visualizations remains difficult due to its distinct paradigms and inherent performance challenges. We identify three classes of video data visualization - synchronization, annotation, and transformation - and integrate them into the Vega declarative grammar. We show that these abstractions enable high-performance implementation. To reconcile Vega's instantaneous dataflow with video player state, we introduce a split-signal architecture that preserves declarative semantics while masking video update delays. We detect continuous scrubbing interactions at compile time to apply encoding-aware optimizations that improve responsiveness by up to 4x. We also repurpose VOD protocols to transform…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
