Loading paper
SILC: Improving Vision Language Pretraining with Self-Distillation | Tomesphere