Loading paper
Exploring the Zero-Shot Capabilities of Vision-Language Models for Improving Gaze Following | Tomesphere