Effects of Cross-lingual Evidence in Multilingual Medical Question Answering
Anar Yeginbergen, Maite Oronoz, Rodrigo Agerri

TL;DR
This study explores how different external evidence sources and retrieval strategies impact multilingual medical question answering, revealing that effectiveness varies with language resource availability and model size.
Contribution
It systematically evaluates the effects of curated, web, and LLM explanations across languages and retrieval methods, highlighting nuanced strategies for low-resource settings.
Findings
Larger models perform better in English across evaluations.
Web-retrieved data benefits high-resource languages most.
Combining English and target language retrieval improves low-resource language performance.
Abstract
This paper investigates Multilingual Medical Question Answering across high-resource (English, Spanish, French, Italian) and low-resource (Basque, Kazakh) languages. We evaluate three types of external evidence sources across models of varying size: curated repositories of specialized medical knowledge, web-retrieved content, and explanations from LLM's parametric knowledge. Moreover, we conduct experiments with multilingual, monolingual and cross-lingual retrieval. Our results demonstrate that larger models consistently achieve superior performance in English across baseline evaluations. When incorporating external knowledge, web-retrieved data in English proves most beneficial for high-resource languages. Conversely, for low-resource languages, the most effective strategy combines retrieval in both English and the target language, achieving comparable accuracy to high-resource…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
