Loading paper
$R^2$-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding | Tomesphere