Loading paper
SDMPrune: Self-Distillation MLP Pruning for Efficient Large Language Models | Tomesphere