Loading paper
Planner-Refiner: Dynamic Space-Time Refinement for Vision-Language Alignment in Videos | Tomesphere