Loading paper
From Multimodal to Unimodal Attention in Transformers using Knowledge Distillation | Tomesphere