Loading paper
Assessing the Visual Enumeration Abilities of Specialized Counting Architectures and Vision-Language Models | Tomesphere