Loading paper
Self-attention as an attractor network: transient memories without backpropagation | Tomesphere