Loading paper
MLB: A Scenario-Driven Benchmark for Evaluating Large Language Models in Clinical Applications | Tomesphere