Loading paper
M3Exam: A Multilingual, Multimodal, Multilevel Benchmark for Examining Large Language Models | Tomesphere