StoryNavi: On-Demand Narrative-Driven Reconstruction of Video Play With   Generative AI

Alston Lantian Xu; Tianwei Ma; Tianmeng Liu; Can Liu; Alvaro; Cassinelli

arXiv:2410.03207·cs.HC·October 7, 2024

StoryNavi: On-Demand Narrative-Driven Reconstruction of Video Play With Generative AI

Alston Lantian Xu, Tianwei Ma, Tianmeng Liu, Can Liu, Alvaro, Cassinelli

PDF

Open Access

TL;DR

StoryNavi leverages generative AI and large language models to create customized, narrative-driven, non-linear video plays from lengthy videos, enhancing user engagement and retrieval efficiency.

Contribution

The paper introduces a novel system that constructs personalized, narrative-based video sequences from original videos using retrieval and generative AI, enabling flexible playback modes.

Findings

01

Adequate retrieval performance compared to human retrieval

02

Narrative coherence improves user engagement

03

User preferences vary based on video genre and query

Abstract

Manually navigating lengthy videos to seek information or answer questions can be a tedious and time-consuming task for users. We introduce StoryNavi, a novel system powered by VLLMs for generating customised video play experiences by retrieving materials from original videos. It directly answers users' query by constructing non-linear sequence with identified relevant clips to form a cohesive narrative. StoryNavi offers two modes of playback of the constructed video plays: 1) video-centric, which plays original audio and skips irrelevant segments, and 2) narrative-centric, narration guides the experience, and the original audio is muted. Our technical evaluation showed adequate retrieval performance compared to human retrieval. Our user evaluation shows that maintaining narrative coherence significantly enhances user engagement when viewing disjointed video segments. However, factors…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Games · Digital Games and Media · Data Visualization and Analytics