Loading paper
Localized Vision-Language Matching for Open-vocabulary Object Detection | Tomesphere