# LLM-based feature selection and counterfactual explanations applied to functional connectivity analysis in schizophrenia

**Authors:** Xinyan Yuan, Tiantian Chen, Yanyan He, Lingling Gu, Ying Sun, Shaolong Wei

PMC · DOI: 10.3389/fnins.2025.1732013 · Frontiers in Neuroscience · 2026-01-12

## TL;DR

This paper introduces a new method using large language models and counterfactual explanations to better understand brain connectivity in schizophrenia.

## Contribution

The novel framework integrates LLM-guided feature selection with counterfactual explanations to improve interpretability and biological relevance in schizophrenia FC analysis.

## Key findings

- The proposed framework improves classification performance on multiple schizophrenia datasets.
- The method provides biologically plausible and interpretable insights into functional connectivity patterns.
- Key limitations include challenges with data heterogeneity and hyperparameter optimization.

## Abstract

Schizophrenia (SZ) is a complex psychiatric disorder whose neural mechanisms are still unclear. Functional connectivity (FC) provides a unique perspective for understanding its pathology, but its high-dimensional nature poses significant challenges for feature selection and model interpretation. Traditional feature selection methods, while predictive, lack the integration of prior neuroscience knowledge, resulting in limited clinical relevance.

To address this, we propose an innovative framework that combines feature selection guided by a large language model (LLM) with counterfactual explanation. This framework leverages brain disease knowledge encoded by the LLM to guide dimensionality reduction of high-dimensional FC, ensuring that selected features are both statistically significant and biologically plausible. Counterfactual explanations are then used to generate causal intervention examples, which are then translated by the LLM into intuitive explanations in natural language, providing understandable and actionable clinical insights for individual patients or physicians.

We validate our approach on five real-world SZ datasets and demonstrate that it not only improves model classification performance but also provides new insights into SZ analysis.

The LLM-based FC analysis method proposed in this study demonstrates good feature selection and interpretability on multiple SZ datasets. Its main advantage is its ability to effectively screen key FC features for brain regions. However, this method has some limitations, such as being difficult to directly apply clinically due to data heterogeneity, being unable to accurately locate individual FC abnormalities, and the hyperparameters for counterfactual generation not yet being optimized.

## Linked entities

- **Diseases:** schizophrenia (MONDO:0005090)

## Full-text entities

- **Diseases:** SZ (MESH:D012559), psychiatric disorder (MESH:D001523), brain disease (MESH:D001927), FC abnormalities (MESH:D000014)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12832681/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12832681/full.md

## References

53 references — full list in the complete paper: https://tomesphere.com/paper/PMC12832681/full.md

---
Source: https://tomesphere.com/paper/PMC12832681