Loading paper
MM-Eval: A Hierarchical Benchmark for Modern Mongolian Evaluation in LLMs | Tomesphere