Loading paper
SimpleDevQA: Benchmarking Large Language Models on Development Knowledge QA | Tomesphere