Loading paper
FILIP: Fine-grained Interactive Language-Image Pre-Training | Tomesphere