Loading paper
CanKD: Cross-Attention-based Non-local operation for Feature-based Knowledge Distillation | Tomesphere