Probabilistic Adaptation of Text-to-Video Models
Mengjiao Yang, Yilun Du, Bo Dai, Dale Schuurmans, Joshua B. Tenenbaum,, Pieter Abbeel

TL;DR
This paper introduces Video Adapter, a probabilistic method to adapt large text-to-video models to specific tasks without finetuning, enabling high-quality, task-specific video generation in domains like animation and robotics.
Contribution
Proposes Video Adapter, a novel probabilistic approach that leverages the score function of a large pretrained video diffusion model for task-specific adaptation without finetuning.
Findings
Effective in adapting to diverse domains like animation and robotics
Maintains high fidelity and broad knowledge of large models
Enables high-quality, specialized video generation
Abstract
Large text-to-video models trained on internet-scale data have demonstrated exceptional capabilities in generating high-fidelity videos from arbitrary textual descriptions. However, adapting these models to tasks with limited domain-specific data, such as animation or robotics videos, poses a significant computational challenge, since finetuning a pretrained large model can be prohibitively expensive. Inspired by how a small modifiable component (e.g., prompts, prefix-tuning) can adapt a large language model to perform new tasks without requiring access to the model weights, we investigate how to adapt a large pretrained text-to-video model to a variety of downstream domains and tasks without finetuning. In answering this question, we propose Video Adapter, which leverages the score function of a large pretrained video diffusion model as a probabilistic prior to guide the generation of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Natural Language Processing Techniques
MethodsDiffusion · Adapter
