Loading paper
On the Limits of Layer Pruning for Generative Reasoning in Large Language Models | Tomesphere