Loading paper
IF-VidCap: Can Video Caption Models Follow Instructions? | Tomesphere