Loading paper
tinyBenchmarks: evaluating LLMs with fewer examples | Tomesphere