Loading paper
Data-Free Pruning of Self-Attention Layers in LLMs | Tomesphere