Loading paper
Transformers perform adaptive partial pooling | Tomesphere