# Explaining care need assessment surveys: qualitative and quantitative evaluation of state-of-the-art local and global explainable artificial intelligence methods

**Authors:** Necip Oğuz Şerbetci, Stefan Blüher, Paul Gellert, Ulf Leser

PMC · DOI: 10.1093/jamiaopen/ooaf064 · JAMIA Open · 2025-07-29

## TL;DR

This paper explores how explainable AI methods can help identify key factors influencing care need assessments using thousands of German care benefits applications.

## Contribution

A systematic comparison of traditional and state-of-the-art XAI methods for extracting insights from unstructured care assessment text data.

## Key findings

- Transformer-based models outperformed traditional baselines in predicting care need assessments.
- XAI methods extracted key phrases that balance rare and common terms, improving module-specific insights.
- Both traditional and deep learning XAI approaches provided useful phrases for understanding care need factors.

## Abstract

With extended life expectancy, the number of people in need of care has been growing. To optimally support them, it is important to know the patterns and conditions of their daily life that influence the need for support, and thus, the classification of the care need. In this study, we aim to utilize a large corpus consisting of care benefits applications to do an explorative analysis of factors affecting care need to support the tedious work of experts gathering reliable criteria for a care need assessment.

We compare state-of-the-art methods from explainable artificial intelligence (XAI) as means to extract such patterns from over 72 000 German care benefits applications. We train transformer models to predict assessment results as decided by a Medical Service Unit from accompanying text notes. To understand the key factors for care need assessment and its constituent modules (such as mobility and self-therapy), we apply feature attribution methods to extract the key phrases for each prediction. These local explanations are then aggregated into global insights to derive key phrases for different modules and severity of care need over the dataset.

Our experiments show that transformers-based models perform slightly better than traditional bag-of-words baselines in predicting care need. We find that the bag-of-words baseline also provides useful care-relevant phrases, whereas phrases obtained through transformer explanations better balance rare and common phrases, such as diagnoses mentioned only once, and are better in assigning the correct assessment module.

Even though XAI results can become unwieldy, they let us get an understanding of thousands of documents with no extra annotations other than existing assessment outcomes.

This work provides a systematic application and comparison of both traditional and state-of-the-art deep learning based XAI approaches to extract insights from a large corpus of text. Both traditional and deep learning approaches provide useful phrases, and we recommend using both to explore and understand large text corpora better. We will make our code available at https://github.com/oguzserbetci/explainer.

## Full-text entities

- **Diseases:** MD (MESH:C535955), impaired memory (MESH:D008569), XAI (MESH:C538243), visual hallucinations (MESH:D006212)
- **Chemicals:** AOPCglobal (-), water (MESH:D014867)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12307913/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12307913/full.md

## References

26 references — full list in the complete paper: https://tomesphere.com/paper/PMC12307913/full.md

---
Source: https://tomesphere.com/paper/PMC12307913