Loading paper
VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing | Tomesphere