Loading paper
Act2See: Emergent Active Visual Perception for Video Reasoning | Tomesphere