Loading paper
One-for-All Pruning: A Universal Model for Customized Compression of Large Language Models | Tomesphere