Loading paper
Efficient Vocabulary-Free Fine-Grained Visual Recognition in the Age of Multimodal LLMs | Tomesphere