Automating Clinical Information Retrieval from Finnish Electronic Health Records Using Large Language Models

Mikko Saukkoriipi; Nicole Hernandez; Jaakko Sahlsten; Kimmo Kaski; Otso Arponen

arXiv:2603.26434·cs.CL·March 30, 2026

Automating Clinical Information Retrieval from Finnish Electronic Health Records Using Large Language Models

Mikko Saukkoriipi, Nicole Hernandez, Jaakko Sahlsten, Kimmo Kaski, Otso Arponen

PDF

TL;DR

This study demonstrates that open-source large language models can accurately retrieve patient-specific information from Finnish electronic health records through a locally deployable question answering framework, emphasizing clinical validation.

Contribution

The paper introduces a fully offline, open-source LLM-based framework for clinical question answering from EHRs, with benchmarking on Finnish clinical data and analysis of deployment considerations.

Findings

01

Llama-3.1-70B achieved 95.3% accuracy in free-text generation.

02

Models maintained performance with low-precision quantization.

03

Clinically significant errors occurred in 2.9% of outputs.

Abstract

Clinicians often need to retrieve patient-specific information from electronic health records (EHRs), a task that is time-consuming and error-prone. We present a locally deployable Clinical Contextual Question Answering (CCQA) framework that answers clinical questions directly from EHRs without external data transfer. Open-source large language models (LLMs) ranging from 4B to 70B parameters were benchmarked under fully offline conditions using 1,664 expert-annotated question-answer pairs derived from records of 183 patients. The dataset consisted predominantly of Finnish clinical text. In free-text generation, Llama-3.1-70B achieved 95.3% accuracy and 97.3% consistency across semantically equivalent question variants, while the smaller Qwen3-30B-A3B-2507 model achieved comparable performance. In a multiple-choice setting, models showed similar accuracy but variable calibration.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.