Loading paper
CompeteSMoE -- Effective Training of Sparse Mixture of Experts via Competition | Tomesphere