Loading paper
Temporal-Oriented Recipe for Transferring Large Vision-Language Model to Video Understanding | Tomesphere