Loading paper
SPION: Layer-Wise Sparse Training of Transformer via Convolutional Flood Filling | Tomesphere