Loading paper
Assessing Large Language Models for Medical QA: Zero-Shot and LLM-as-a-Judge Evaluation | Tomesphere