Loading paper
A Transformer-in-Transformer Network Utilizing Knowledge Distillation for Image Recognition | Tomesphere