Loading paper
47B Mixture-of-Experts Beats 671B Dense Models on Chinese Medical Examinations | Tomesphere