Generative AI for Video Trailer Synthesis: From Extractive Heuristics to Autoregressive Creativity

Abhishek Dharmaratnakar; Srivaths Ranganathan; Debanshu Das; Anushree Sinha

arXiv:2604.04953·cs.CV·April 8, 2026

Generative AI for Video Trailer Synthesis: From Extractive Heuristics to Autoregressive Creativity

Abhishek Dharmaratnakar, Srivaths Ranganathan, Debanshu Das, Anushree Sinha

PDF

TL;DR

This paper reviews the evolution of automatic video trailer generation from heuristic methods to advanced generative models, highlighting recent AI techniques like LLMs, diffusion models, and foundation models.

Contribution

It provides a comprehensive survey of generative techniques for trailer synthesis, introduces a new taxonomy, and discusses future directions beyond extractive methods.

Findings

01

Deep generative models enable coherent, emotionally resonant trailers.

02

Transition from heuristic extraction to autoregressive and foundation models.

03

Discussion of ethical and economic implications of neural synthesis.

Abstract

The domain of automatic video trailer generation is currently undergoing a profound paradigm shift, transitioning from heuristic-based extraction methods to deep generative synthesis. While early methodologies relied heavily on low-level feature engineering, visual saliency, and rule-based heuristics to select representative shots, recent advancements in Large Language Models (LLMs), Multimodal Large Language Models (MLLMs), and diffusion-based video synthesis have enabled systems that not only identify key moments but also construct coherent, emotionally resonant narratives. This survey provides a comprehensive technical review of this evolution, with a specific focus on generative techniques including autoregressive Transformers, LLM-orchestrated pipelines, and text-to-video foundation models like OpenAI's Sora and Google's Veo. We analyze the architectural progression from Graph…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.