Loading paper
Exploring Primitive Visual Measurement Understanding and the Role of Output Format in Learning in Vision-Language Models | Tomesphere