Loading paper
OPTIMA: Optimal One-shot Pruning for LLMs via Quadratic Programming Reconstruction | Tomesphere