Loading paper
Interpretable Zero-Shot Learning with Locally-Aligned Vision-Language Model | Tomesphere