Loading paper
Olica: Efficient Structured Pruning of Large Language Models without Retraining | Tomesphere