Loading paper
Mechanisms of Object Localization in Vision-Language Models | Tomesphere