Loading paper
Stream separation improves Bregman conditioning in transformers | Tomesphere