Zero-Shot Dynamic Concept Personalization with Grid-Based LoRA

Rameen Abdal; Or Patashnik; Ekaterina Deyneka; Hao Chen; Aliaksandr Siarohin; Sergey Tulyakov; Daniel Cohen-Or; Kfir Aberman

arXiv:2507.17963·cs.GR·July 25, 2025

Zero-Shot Dynamic Concept Personalization with Grid-Based LoRA

Rameen Abdal, Or Patashnik, Ekaterina Deyneka, Hao Chen, Aliaksandr Siarohin, Sergey Tulyakov, Daniel Cohen-Or, Kfir Aberman

PDF

Open Access

TL;DR

This paper presents a zero-shot framework for dynamic concept personalization in text-to-video generation, enabling high-quality, identity-preserving videos without per-instance fine-tuning by using structured video grids and lightweight adapters.

Contribution

It introduces a novel zero-shot approach using Grid-LoRA adapters and a Grid Fill module for dynamic concept editing in text-to-video models, eliminating the need for test-time optimization.

Findings

01

High-quality, consistent results across diverse subjects.

02

Effective generalization to unseen dynamic concepts.

03

Operates in a single forward pass without fine-tuning.

Abstract

Recent advances in text-to-video generation have enabled high-quality synthesis from text and image prompts. While the personalization of dynamic concepts, which capture subject-specific appearance and motion from a single video, is now feasible, most existing methods require per-instance fine-tuning, limiting scalability. We introduce a fully zero-shot framework for dynamic concept personalization in text-to-video models. Our method leverages structured 2x2 video grids that spatially organize input and output pairs, enabling the training of lightweight Grid-LoRA adapters for editing and composition within these grids. At inference, a dedicated Grid Fill module completes partially observed layouts, producing temporally coherent and identity preserving outputs. Once trained, the entire system operates in a single forward pass, generalizing to previously unseen dynamic concepts without…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques · Video Analysis and Summarization · Advanced Image and Video Retrieval Techniques