Loading paper
V*: Guided Visual Search as a Core Mechanism in Multimodal LLMs | Tomesphere