Loading paper
Self-attention vector output similarities reveal how machines pay attention | Tomesphere