LLMPopcorn: Exploring LLMs as Assistants for Popular Micro-video Generation
Junchen Fu, Xuri Ge, Kaiwen Zheng, Alexandros Karatzoglou, Ioannis Arapakis, Xin Xin, Yongxin Ni, Joemon M. Jose

TL;DR
This paper investigates how large language models can be used to autonomously generate popular micro-videos, demonstrating that advanced LLMs can produce content rivaling human-created videos and exploring prompt-based enhancements for better results.
Contribution
It is the first to explore LLM-assisted micro-video generation, benchmarking various models and proposing prompt enhancements to improve video popularity.
Findings
DeepSeek-V3 generates videos with popularity comparable to human content
Prompt enhancements significantly improve video popularity
DeepSeek-V3 and R1 outperform other LLMs, LTX-Video and HunyuanVideo in benchmarks
Abstract
In an era where micro-videos dominate platforms like TikTok and YouTube, AI-generated content is nearing cinematic quality. The next frontier is using large language models (LLMs) to autonomously create viral micro-videos, a largely untapped potential that could shape the future of AI-driven content creation. To address this gap, this paper presents the first exploration of LLM-assisted popular micro-video generation (LLMPopcorn). We selected popcorn as the icon for this paper because it symbolizes leisure and entertainment, aligning with this study on leveraging LLMs as assistants for generating popular micro-videos that are often consumed during leisure time. Specifically, we empirically study the following research questions: (i) How can LLMs be effectively utilized to assist popular micro-video generation? (ii) To what extent can prompt-based enhancements optimize the LLM-generated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimedia Communication and Technology · Video Analysis and Summarization
