Loading paper
A Training-Free Guess What Vision Language Model from Snippets to Open-Vocabulary Object Detection | Tomesphere