Loading paper
RecurFormer: Not All Transformer Heads Need Self-Attention | Tomesphere