Loading paper
Simple Open-Vocabulary Object Detection with Vision Transformers | Tomesphere