Loading paper
Strassen Attention, Split VC Dimension and Compositionality in Transformers | Tomesphere