Loading paper
Automated Rubrics for Reliable Evaluation of Medical Dialogue Systems | Tomesphere