Explanatory Argument Extraction of Correct Answers in Resident Medical Exams
Iakes Goenaga, Aitziber Atutxa, Koldo Gojenola, Maite Oronoz, Rodrigo, Agerri

TL;DR
This paper introduces a new Spanish medical exam dataset with explanations for correct and incorrect answers, enabling automated evaluation of language models' ability to extract relevant medical explanations, aiding practitioners.
Contribution
The work presents a novel dataset with doctor-written explanations for both correct and incorrect answers in Spanish medical exams, facilitating extractive QA tasks and automated model evaluation.
Findings
Multilingual models sometimes outperform monolingual models.
Smaller models can perform competitively despite being inferior.
The dataset effectively helps identify relevant evidence-based medical explanations.
Abstract
Developing the required technology to assist medical experts in their everyday activities is currently a hot topic in the Artificial Intelligence research field. Thus, a number of large language models (LLMs) and automated benchmarks have recently been proposed with the aim of facilitating information extraction in Evidence-Based Medicine (EBM) using natural language as a tool for mediating in human-AI interaction. The most representative benchmarks are limited to either multiple-choice or long-form answers and are available only in English. In order to address these shortcomings, in this paper we present a new dataset which, unlike previous work: (i) includes not only explanatory arguments for the correct answer, but also arguments to reason why the incorrect answers are not correct; (ii) the explanations are written originally by medical doctors to answer questions from the Spanish…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Artificial Intelligence in Healthcare and Education
