Generating Narrated Lecture Videos from Slides with Synchronized Highlights
Alexander Holmberg

TL;DR
This paper presents an automated system that converts slide decks into engaging narrated video lectures with synchronized visual highlights, leveraging advanced alignment techniques for precise attention guidance.
Contribution
The novel highlight alignment module accurately maps spoken phrases to slide locations using multiple strategies, enabling scalable and cost-effective video lecture generation.
Findings
High location accuracy with F1 > 92% using LLM-based alignment
Significant cost reduction to under $1 per hour of video
Effective handling of complex, math-heavy slide content
Abstract
Turning static slides into engaging video lectures takes considerable time and effort, requiring presenters to record explanations and visually guide their audience through the material. We introduce an end-to-end system designed to automate this process entirely. Given a slide deck, this system synthesizes a video lecture featuring AI-generated narration synchronized precisely with dynamic visual highlights. These highlights automatically draw attention to the specific concept being discussed, much like an effective presenter would. The core technical contribution is a novel highlight alignment module. This module accurately maps spoken phrases to locations on a given slide using diverse strategies (e.g., Levenshtein distance, LLM-based semantic analysis) at selectable granularities (line or word level) and utilizes timestamp-providing Text-to-Speech (TTS) for timing synchronization.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimedia Communication and Technology · Video Analysis and Summarization · Subtitles and Audiovisual Media
MethodsSoftmax · Attention Is All You Need
