Loading paper
TransfoRNN: Capturing the Sequential Information in Self-Attention Representations for Language Modeling | Tomesphere