Loading paper
DOTResize: Reducing LLM Width via Discrete Optimal Transport-based Neuron Merging | Tomesphere