Loading paper
Bilingual Evaluation of Language Models on General Knowledge in University Entrance Exams with Minimal Contamination | Tomesphere