Loading paper
Spark Transformer: Reactivating Sparsity in FFN and Attention | Tomesphere