Loading paper
UrBLiMP: A Benchmark for Evaluating the Linguistic Competence of Large Language Models in Urdu | Tomesphere