Reconstructing brain causal dynamics for subject and task fingerprints using fMRI time-series data

Dachuan Song; Li Shen; Duy Duong-Tran; Xuan Wang

PMC · DOI:10.1007/s13755-025-00388-w·October 28, 2025

Reconstructing brain causal dynamics for subject and task fingerprints using fMRI time-series data

Dachuan Song, Li Shen, Duy Duong-Tran, Xuan Wang

PDF

Open Access

TL;DR

This paper introduces a new method using fMRI data to identify individuals and tasks based on brain causal dynamics, showing biological relevance and potential for neuroscience applications.

Contribution

A novel two-timescale state-space model and brain reachability landscape visualization for capturing causal brain dynamics in subject and task fingerprinting.

Findings

01

The model captures causal brain region interactions and disentangled dynamic modes from fMRI data.

02

Causal signatures improve subject identification and task classification compared to non-causal methods.

03

The brain reachability landscape provides a new way to visualize and quantify brain activation under different tasks.

Abstract

Recently, there has been a revived interest in system neuroscience causation models, driven by their unique capability to unravel complex relationships in multi-scale brain networks. In this paper, we present a novel method that leverages causal dynamics to achieve effective fMRI-based subject and task fingerprinting. By applying an implicit-explicit discretization scheme, we develop a two-timescale linear state-space model. Through data-driven identification of its parameters, the model captures causal signatures, including directed interactions among brain regions from a spatial perspective, and disentangled fast and slow dynamic modes of brain activity from a temporal perspective. These causal signatures are then integrated with: (i) a modal decomposition and projection method for model-based subject identification, and (ii) a Graph Neural Network (GNN) framework for learning-based…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Species1

Homo sapiens(human · species)

Diseases1

neurodegenerative diseases

Figures8

Click any figure to enlarge with its caption.

Two timescale partition of a single fMRI sampling period $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$[(k-1)\Delta t,\,k\Delta t]$$\end{document}$ . The interval is split into a fast sub-interval of duration $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{

Architecture of the five-layer GNN used for task fingerprinting

Three-tier illustration mapping the reachable-set activation surface (top) onto a flattened 12 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\times$$\end{document}$ 12 cortical grid (middle) and, from there, onto its anatomical locations in a 3-D brain rendering (bottom). Colored tiles mark four representative regions

Left: Subject Fingerprinting based on CM+MD&P: The identification accuracy is compared with (black line) using the FC+CoR method. Right: Task Fingerprinting based on CM+GNN: The figure shows the correct/incorrect classification for the same/different fMRI tasks. The y-axis is split. The top segment (95–100%) shows the near-perfect accuracies, while the bottom segment (0–2%) zooms in on the small error rates

Visualization of Brain Regions with Strongest Connection Weights, Showing Association Strengths Across Multiple Cognitive Tasks (Importance Scores Indicated by Color-Bar). LH refers to the left hemisphere, and RH refers to the right hemisphere of the brain. Arrows indicate directed relationships, illustrating which regions exert causal influence on others. For clarity of presentation, we employ a two-color scheme and apply a threshold to highlight only the most active brain areas and their directional interactions

Reachability-based heatmaps for one subject across eight cognitive tasks (Rest, Emot, Gamb, Lang, Moto, Rela, Soci, and WMem). Brighter colors (yellow) indicate stronger maximum neural activations, while darker tones (blue) reflect lower excitability. Red dashed circles highlight representative activation hotspots corresponding to critical brain regions known for each cognitive task. The black boundary outlines the edge of the flattened cortical map. The correspondence between this reachability representation and anatomical brain regions is illustrated in Fig. [4](#Fig4). These patterns highli

Funding3

—http://dx.doi.org/10.13039/100000084Directorate for Engineering
—National Institute of Health
—National Institute of Health

Keywords

FMRI fingerprintingBrain causal dynamicsTwo-timescale state-space modelReachability analysis

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFunctional Brain Connectivity Studies · EEG and Brain-Computer Interfaces · Action Observation and Synchronization

Full text

Introduction

Advancements in brain imaging technologies [1], especially Functional Magnetic Resonance Imaging (fMRI) of Blood-Oxygen-Level-Dependent (BOLD) signals, have enabled precise and quantitative measures of brain activity, leading to a significant boost in neuroscience research [2]. Among various brain modeling approaches, causation models have recently received a revamped interest [3, 4] in recent years, due to their capability to reveal complex relationships without multi-scale brain networks. These models have been extended in multiple directions to understand the brain’s underlying processes [5], explaining how these processes support cognition and behavior [6–8], and propose new methods to regulate brain activity [9, 10] for enhancing normal functions and mitigating disorders [11]. However, most fingerprinting pipelines still rely on static, symmetric connectivity maps and thus miss the directed, multi-timescale dynamics that underlie neural communication.

Inspired by these developments, this paper explores the use of causal dynamic models to extract informative features from the brain for fMRI fingerprinting. Specifically, we aim to leverage causal signatures derived from fMRI time-series data to identify both the individual subject (subject fingerprinting) and the cognitive task being performed (task fingerprinting). A major challenge in this process arises from the variability across subjects and tasks, along with the limited number of recordings available for each subject-task combination [12]. In other words, the problem combines heterogeneity (across subjects/tasks) with data sparsity, which calls for models that are both interpretable and sample-efficient. Our approach focuses on uncovering and utilizing latent causal signatures in the brain. While this requires more specialized model structures, it can offer improved data efficiency. This distinguishes our method from traditional correlation-based approaches [7, 10, 13–19], which are limited in their ability to capture the directional, evolving cause-and-effect relationships that are fundamental to cognitive dynamics.

Statement of contribution: In this paper, our result provides a solid ‘YES’ answer to the feasibility and effectiveness of utilizing brain causal signatures for subject and task fingerprinting. Our main contributions are summarized as follows:

Introducing a two-timescale linear state-space model, which (i) captures directed interactions among brain regions from a spatial perspective, and (ii) disentangles fast and slow dynamic modes of brain activity from a temporal perspective. Model parameters are identified using a data-driven, implicit-explicit discretization scheme.
Integrating the causal signatures with (i) a modal decomposition and projection method for model-based subject identification, and (ii) a Graph Neural Network (GNN) framework for learning-based task classification.
Leveraging the control-theoretic interpretation of the causal-based model to introduce the concept of Reachability landscape as a novel visualization tool, which quantitatively characterizes the maximum possible activation levels of brain regions under various fMRI tasks. Extensive experiments and comparative analyses are conducted to validate the proposed approach. This research also lays the groundwork for future exploration of causal fingerprinting in both healthy individuals and those affected by neurodegenerative disorders.

Related work

The concept of an fMRI “fingerprint” [7, 10, 13–17, 20–22] can be considered as a classification problem (identifying which subject or task corresponds to a given fMRI sample). In conventional applications, data richness has a major impact on classification accuracy. However, in data-scarce settings, model structure and algorithmic interpretability become critical for maintaining robust performance. For example, in visual person re-identification (re-ID) tasks, incorporating auxiliary features such as gender, color, and texture can significantly improve identification accuracy [23]. However, a similar idea is not directly applicable to fMRI-based applications, as raw brain signals are inherently difficult to interpret or to translate into distinguishable features.

Static connectivity models

In neuroscience research, one strategy to address this challenge is to use the functional connectome (FC) [12]. The FC measures the correlations of the fMRI time-series data among pair-wise brain regions during rest or task conditions, and it has been successfully utilized as individual-specific neural signatures to classify various cognitive states and neurological disorders [18, 24]. However, FC is static and undirected, thus missing causal directionality. Another common brain-network modeling technique is Independent Component Analysis (ICA). This approach separates fMRI data into spatially distinct, statistically independent components corresponding to different brain networks [25, 26]. It has proven especially useful for revealing resting-state networks and explaining their roles in cognitive function and pathology [27–29]. Although FC and ICA have achieved notable success in many applications, they still have several limitations. In particular, they do not fully capture the dynamic nature of brain activity, as they lack the temporal resolution needed to detail interactions among regions, and cannot represent the directionality of connections between regions [14, 16, 24, 30]. Each of these aspects, dynamics, temporal detail, and directionality, is essential for understanding the cause-and-effect interactions within brain networks.

Causal connectivity models

In contrast to static correlational methods, causality-based approaches such as Dynamic Causal Modeling (DCM) and Granger causality offer alternative insights into patterns of brain interactions. However, these directed connectivity methods have been relatively under-explored in the context of fMRI fingerprinting. Analyzing causality in fMRI entails examining directional influences among brain regions over time, which provides a window into the brain’s dynamic modes of interaction. For example, DCM uses a Bayesian framework to model interactions among latent neural states with bilinear differential equations, which describes how activity in one region influences others over time [31, 32]. Similarly, Granger causality evaluates whether one time series can predict another, thereby identifying potential directed relationships between neural signals [33–35]. Yet such directed methods remain under-explored for fMRI fingerprinting. Most Dynamic Causal Modeling (DCM) or Granger-causality studies restrict the analysis to a small number of regions of interest (ROIs) and assume multiple sessions per subject. Consequently, it is unclear how well these approaches scale to whole-cortex parcellations with few runs per subject, or to joint subject and task fingerprinting. To further enhance these causal models and better capture complex neural dynamics, recent research has introduced several modifications. For instance, threshold-based structures have been incorporated to reflect the saturation of neural activation [36–38], and multifactorial dynamics have been proposed to represent systems that operate across multiple time scales [39].

Learning-based approaches

Beyond these correlation-based or causality-based methodologies, classical machine-learning algorithms also remain popular. Random Forest (RF) [40] and Support Vector Machines (SVM) [41] provide high interpretability and easy deployment across diverse datasets. Recent studies on fMRI fingerprinting have also explored a variety of advanced techniques to capture different facets of brain dynamics and connectivity patterns. These include deep-learning architectures. Deep Neural Networks (DNN) capture high-dimensional connectivity patterns [42]. Deep Bidirectional Recurrent Neural Networks (DBRNN) model temporal dynamics in brain signals [43]. Brain Attend and Decode (BAnD) applies attention mechanisms to highlight salient brain regions [44]. A recent variant, TCN–BiLSTM, combines dilated Temporal Convolutional Networks (TCNs) with bidirectional Long Short-Term Memory (BiLSTM) layers to model short- and long-range dependencies in fMRI sequences [45]. However, it still operates on undirected regional time series and lacks an explicit multi-timescale causal framework, which limits its ability to characterize directed network interactions. Geometry- and topology-based pipelines have also been proposed. Tangent-Space Fingerprinting (TSF) maps functional-connectivity matrices to the Riemannian tangent space of the symmetric positive-definite (SPD) manifold, capturing global correlation geometry while still treating connectivity as static and undirected [46]. The PH-H0 landscape describes networks via zero-dimensional persistent homology (H0), providing a noise-robust topological summary yet discarding temporal scale and edge direction [47]. Classical classifiers such as Random Forests and Support Vector Machines are trained on pre-specified connectivity vectors. Without directionality or temporal context, their accuracy soon plateaus and lags behind causal models [40, 41]. End-to-end deep networks (DNN, DBRNN, BAnD) can harvest richer patterns, yet they require large training sets and remain hard to interpret. Geometry-based TSF and topology-based PH-H0 offer elegant dimensionality reduction, but both treat connectivity as static and undirected [47]. Across these lines of work, directed interactions and the multiple time scales that shape cognition are largely ignored. Our method fills this gap: a single scan yields compact causal signatures that encode directionality and tempo, and a lightweight GNN turns them into accurate fingerprints while preserving biophysical meaning.

Methods: causal fingerprint

In this paper, the concept of causal fingerprint is defined as follows: (later referred to as fingerprint, for simplicity.)

Definition 1

Causal fingerprint is the degree to which a subject or an fMRI task can be identified from a labeled database, particularly based on the subjects’ or fMRI tasks’ unique cause-and-effect cognitive signatures.

To summarize our method, we introduce an implicit-explicit discretization method that yields a two-timescale state-space model to capture causal signatures. Then we combine the causal signatures with a modal decomposition and projection method for subject fingerprinting and with a GNN model for task fingerprinting. Finally, building on the state-space model, we propose a new visualization tool that characterizes the reachability of the brain state, which quantitatively represents the maximally possible excitation level for different brain regions.

Notations. Throughout this section, suppose we are given an fMRI data set with $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathcal {S}}$$\end{document}$ representing the set of all subjects, and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathcal {H}}$$\end{document}$ representing the set of all tasks performed by the subjects. Let $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$d_{s,h}\in {\mathbb {R}}^{p\times T}$$\end{document}$ represent the time-series recording of a subject $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s\in {\mathcal {S}}$$\end{document}$ performing an fMRI task $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$h\in {\mathcal {H}}$$\end{document}$ . The dimension p is the number of brain parcellations and T is the testing duration. Let $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$d_{s,h}(k)\in {\mathbb {R}}^{p\times T}$$\end{document}$ represent the $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$k^{th}$$\end{document}$ column of $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$d_{s,h}$$\end{document}$ . Let $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\text {vec}(\cdot )$$\end{document}$ represent the vectorization (stacking all columns) of a matrix into a single-column vector.Fig. 1. Two timescale partition of a single fMRI sampling period $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$[(k-1)\Delta t,\,k\Delta t]$$\end{document}$ . The interval is split into a fast sub-interval of duration $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\tau$$\end{document}$ (short solid red) and a slow sub-interval of duration $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Delta t-\tau$$\end{document}$ (long solid red), with $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0<\tau \ll \Delta t$$\end{document}$ . Driven by the slower input $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$u(k-1)$$\end{document}$ and the current input u(k), the state evolves from the previous scan $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x(k-1)$$\end{document}$ , passes the intermediate point $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$z(k)=x(k-\tau )$$\end{document}$ , and reaches the current scan x(k)Fig. 2a Brain activity (time-series data) captured using Schaefer Parcellation. b Causal dynamics modeling and data-driven parameter identification

A two-timescale state-space model for causal dynamics

To capture the variability of subjects performing different fMRI tasks, we propose a two-time-scale linear state-space model that characterizes the brain’s activity both spatially and temporally. Spatially, the model captures directed interactions among brain regions, identifying which areas causally influence others. Temporally, it disentangles brain dynamics across two time-scales, characterizing both rapid neural signaling and slower hemodynamic responses.

Although fMRI only records data at a low sampling rate, the brain operates on multiple timescales [48]: fast neuronal activities such as synaptic firing and local synchrony occur on the order of milliseconds; while slow hemodynamic responses unfold over several seconds. As a result, the observed fMRI time-series data ( $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$d_{s,h}$$\end{document}$ ) represents an aggregation of these intertwined processes, making it challenging to reconstruct the parameters of the underlying dynamics. To address this, we propose an implicit-explicit discretization method that partitions each sampling period into a fast and a slow sub-interval. The fast dynamics capture concurrent interactions that reflect rapid neuronal activity, while the slow dynamics capture hemodynamic responses and long-timescale network interactions among brain regions.

In the following, we first present the proposed two-time-scale model, then we validate the model by showing how it is mathematically derived and how the model parameters can be obtained through a data-driven method. Since this modeling framework applies generically to any subject and any task, we omit the subscripts s, h for clarity in what follows.

Consider fMRI data with p brain parcellations. We divide the parcellations into two subsets: system states $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x(k)\in {\mathbb {R}}^m$$\end{document}$ and inputs $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$u(k)\in {\mathbb {R}}^n$$\end{document}$ , with $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$m+n=p$$\end{document}$ . Each entry of x(k) or u(k) represents the activity of a specific brain region at discrete time k. We assume that $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$k\in {\mathbb {Z}}$$\end{document}$ indexes successive fMRI scans, typically spaced $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Delta t$$\end{document}$ seconds apart (e.g. $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Delta t=0.72s$$\end{document}$ for the dataset used in this paper). Our aim is to construct a two-timescale state-space model of the following form:

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} x(k) = Qx(k) \!+\! Ax(k-1) \!+\! B_{1}u(k)\!+\! B_{2}u(k-1). \end{aligned}$$\end{document}

Here, $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Q \in {\mathbb {R}}^{m \times m}$$\end{document}$ captures the fast sub-interval (concurrent) interaction among the neuronal states; $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$A \in {\mathbb {R}}^{m \times m}$$\end{document}$ encodes the slow sub-interval (cross-lagged) transition from $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x(k-1)$$\end{document}$ ; $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$B_{1}\,u(k)$$\end{document}$ and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$B_{2}\,u(k-1)$$\end{document}$ incorporate the inputs into the system for fast and slow dynamics, respectively.

Implicit-Explicit discretization: To systematically derive the two-timescale discrete model (1), we divide each fMRI sampling period into slow and fast sub-intervals, as illustrated in Fig. 1. Consider a continuous-time dynamical system with both slow and fast modes:

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \frac{d}{dt} x(t)=F_\text{s}x(t)+G_\text{s} u(t)+ \frac{1}{\epsilon }(F_\text{f} x(t)+G_\text{f} u(t)), \end{aligned}$$\end{document}

where $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0<\epsilon \ll 1$$\end{document}$ indicates the separation of timescales. To discretize the system, we divide the full sampling interval $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Delta t$$\end{document}$ into a slow component $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Delta t - \tau$$\end{document}$ and a fast component $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\tau$$\end{document}$ , such that the ratio of their lengths satisfies:1 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\displaystyle \frac{\Delta t - \tau }{\tau }=\frac{1}{\epsilon }$$\end{document}$ .

The slow dynamics are approximated using an explicit Euler approximation from $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x(k-1)$$\end{document}$ to an intermediate state z(k):

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \frac{z(k)-x(k-1)}{\Delta t - \tau }=\Bigl [F_{\text{s}}\,x(k-1) + G_{\text{s}}\,u(t-1)\Bigr ]. \end{aligned}$$\end{document}

which yields

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} z(k) \!=\! x(k\!-\!1) \!+\! (\Delta t \!-\! \tau )\left[ F_{\text{s}}x(k\!-\!1) \!+\! G_{\text{s}}u(k\!-\!1)\right] \end{aligned}$$\end{document}

The fast dynamics are then modeled as an update over the interval $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\tau$$\end{document}$ , which starts from z(k) and evolves to x(k):

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \frac{x(k)-z(k)}{ \tau }=\Bigl [F_\text{f} x(k)+G_\text{f} u(k)\Bigr ]. \end{aligned}$$\end{document}

which yields:

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} x(k)=\tau \Bigl [F_\text{f} x(k)+G_\text{f} u(k)\Bigr ]+ z(k). \end{aligned}$$\end{document}

Note that the intermediate state z(k) is not directly observable in the fMRI time series, which only samples the system at the slower rate $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Delta t$$\end{document}$ . Thus, we introduce an implicit Euler method in (5) that uses x(k) and u(k) to approximate the update. This allows us to cancel out the intermediate state z(k) and use the one-time scale fMRI time-series data to reconstruct the parameters of the two-time-scale systems. Specifically, by incorporating (4) and (6), one has

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} x(k)=&~\tau F_\text{f} x(k) + \bigl [I + (\Delta t - \tau )\,F_{\text{s}}\bigr ]\,x(t{-}1)\nonumber \\&+ \tau G_\text{f} u(k) + (\Delta t - \tau )G_{\text{s}} u(k-1). \end{aligned}$$\end{document}

Now, define

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} Q&\triangleq \tau \,F_{\text{f}}, \quad A \triangleq I + (\Delta t - \tau )\,F_{\text{s}}, \\ B_{1}&\triangleq \tau \,G_{\text{f}}, \quad B_{2} \triangleq (\Delta t - \tau )\,G_{\text{s}}. \end{aligned}$$\end{document}

The update equation aligns with model (1).

Data-driven reconstruction of model parameters: Given fMRI data $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\{x(k),u(k)\}$$\end{document}$ for $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$k=0,\dots ,T$$\end{document}$ , each time $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$k\ge 1$$\end{document}$ should satisfy equation (1). Define the data matrices $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$X^{1:T}=\begin{bmatrix} x(1)&x(2)&\cdots x(T) \end{bmatrix}$$\end{document}$ and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$U^{0:T-1}=\begin{bmatrix} u(0)&u(1)&\cdots u(T-1) \end{bmatrix}$$\end{document}$ . A compact representation of the dynamics follows

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} X^{1:T}=QX^{1:T} + AX^{0:T-1} + B_1U^{0:T-1}+ B_2U^{1:T} \end{aligned}$$\end{document}

To determine the model parameters in the presence of model imperfections and measurement noise, we formulate the generalized least squares problem:

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned}&\underset{Q, A,B_1,B_2}{\arg \min }~ \lambda (\Vert Q\Vert _F+\Vert A\Vert _F+\Vert B_1\Vert _F+\Vert B_2\Vert _F) \nonumber \\&\!+\! \big \Vert (Q\!-\!I)X^{1:T} \!\!+\! AX^{0:T\!-\!1}\!\!+\! B_1U^{0:T\!-\!1}\!\!+\! B_2U^{1:T}\big \Vert _F \nonumber \\&\text {subject to}\quad Q_{ii}=0,~\forall i\in {\textbf{m}} \end{aligned}$$\end{document}

Here $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Q,A\in {\mathbb {R}}^{m\times m}$$\end{document}$ , $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$B_{1},B_{2}\in {\mathbb {R}}^{m\times n}$$\end{document}$ , $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$X^{1:T},X^{0:T-1}\in {\mathbb {R}}^{m\times T}$$\end{document}$ , and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$U^{0:T-1}, U^{1:T}\in {\mathbb {R}}^{n\times T}$$\end{document}$ , where $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\lambda>0$$\end{document}$ controls the strength of the regularization to prevent over-fitting, and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Vert \cdot \Vert _{F}$$\end{document}$ denotes the Frobenius norm (for any matrix $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$M\!\in \!{\mathbb {R}}^{p\times q}$$\end{document}$ , $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Vert M\Vert _{F}=\sqrt{\sum _{i=1}^{p}\sum _{j=1}^{q}M_{ij}^{2}}$$\end{document}$ ) [50]. The residual term $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\bigl \Vert (Q-I)X^{1:T}+AX^{0:T-1}+B_{1}U^{0:T-1}+B_{2}U^{1:T}\bigr \Vert _{F}$$\end{document}$ is the Frobenius norm of an $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$m\times T$$\end{document}$ matrix, which aims to extract system parameters as causal signatures by minimizing the difference between the system’s measured next time state with the next time state predicted by our two-time scale model. The $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Vert Q\Vert _{F}$$\end{document}$ , $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Vert A\Vert _{F}$$\end{document}$ , $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Vert B_{1}\Vert _{F}$$\end{document}$ and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Vert B_{2}\Vert _{F}$$\end{document}$ regularization terms to prevent over-fitting. By solving (8), we transform a complex fMRI time-series data $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$d(t)=\{x(t), u(t)\}$$\end{document}$ into a structured and meaningful representation $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${R}= [Q~A~B_1~B_2]\in {\mathbb {R}}^{m\times (2\,m+2n)}$$\end{document}$ of brain dynamics signature, as visualized in Fig. 2. This transformation not only makes the data more accessible for subsequent analysis but also reveals the patterns of ‘spatio-temporal’ causality relationships that underpin cognitive processes. In the following, we will build on R to perform subject and task fingerprints.

Remark 1

(Model choice:) In contrast to traditional functional connectome (FC) representations used for brain fingerprinting, the proposed causal signature offers two key advancements: it captures directional interactions (i.e., which region inhibits or excites another) and temporal dependencies (i.e., how earlier neural activities influence subsequent ones across two time-scales). In comparison, the FC method only captures undirected and concurrent relationships between the activities of pairwise brain regions. From a modeling perspective, we note that this work adopts a linear state-space model (1), which represents a relatively simple form of causal modeling compared to the broader literature that incorporates more complex, nonlinear structures—such as bilinear dynamics [31], threshold dynamics [36], and multifactorial time-varying dynamics [39]. However, we argue that the two-timescale structure enabled by implicit-explicit discretization already encodes sufficiently rich features. The use of a simpler model (linear v.s. nonlinear) does not diminish the results of the paper; instead, it provides compelling evidence for the utility of cause-and-effect signatures in fingerprinting applications. Moreover, simpler models generally have mild requirements on data richness. As will be demonstrated in experiments, the proposed approach achieves accuracy comparable to more complex, non-causal methods (e.g., FC and end-to-end learning), even when using low-resolution fMRI data (Schaefer-100 parcellation).

Subject causal fingerprint via modal decomposition and projection

Based on the model (1) and the parameter reconstruction (8), we can obtain a representation $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$R_{s,h}=[Q~A~B_1~B_2]_{s,h}$$\end{document}$ for any subject-task pair that extracts its spatial and temporal signatures across two time-scales. In the following, we incorporate these signatures with a state-space modal decomposition [51] and projection method to perform subject fingerprinting.

For each fMRI task $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$h \in {\mathcal {H}}$$\end{document}$ , we construct a labeled reference set:

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \widehat{{\mathcal {R}}}_h=\{R_{s,h}^D\}, \quad s\in {\mathcal {S}}. \end{aligned}$$\end{document}

where the superscript D denotes data samples with known subject identities. We assume that the dataset includes at least two recordings for every subject-task pair. This allows one sample to be used in the reference set $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\widehat{{\mathcal {R}}}_h$$\end{document}$ , while the remaining samples serve as query instances for evaluation. Given a query sample with causal signature $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$R_{h}^Q$$\end{document}$ of an unknown identity, the objective is to identify the subject label s by comparing it against the labeled reference set $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$R_{s,h}^D$$\end{document}$ .

The key challenge for subject causal fingerprinting lies in the limited data for each subject, as the fMRI data collection is time-consuming. Essentially, we need to solve a one-shot classification problem [23], i.e., only one sample per subject is available to determine their identity from a large number of candidates. To address this, we leverage the control-theoretic interpretation of the model (1), in which the causal system signature $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$R = [Q~A~B_1~B_2]$$\end{document}$ naturally encodes the system’s dynamic modes [51, Chapter 12], reflecting invariant subspaces of brain dynamics. These dynamic modes serve as augmented features for robust subject identification.

We extract dynamic modes through eigendecomposition and coordinate transformation by applying them, respectively, to the fast and slow timescales. For example, to analyze the slow dynamic mode, we treat the system as

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} x(k) = Ax(k-1) + B_{2}u(k-1) + o(k). \end{aligned}$$\end{document}

where $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$o(k)=Qx(k)+ B_{1}u(k)$$\end{document}$ is considered as a perturbation term. To perform modal decomposition, we express $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$A={\widehat{T}}^{-1}{\widehat{\Lambda }} ~{\widehat{T}}$$\end{document}$ , where $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\widehat{\Lambda }}$$\end{document}$ is the diagonal matrix (or a Jordan matrix if not diagonalizable [52]) corresponding to the eignevalues of A.

Left multiplying $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\widehat{T}}^{-1}$$\end{document}$ to (9) yields

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} {\widehat{x}}(t) = {{\widehat{\Lambda }}} {\widehat{x}}(t-1) + {\widehat{B}}_2u(t-1) + {\widehat{o}}(k), \end{aligned}$$\end{document}

where $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\widehat{x}}(t)={\widehat{T}}^{-1}{x}(t)$$\end{document}$ , $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\widehat{B}}={\widehat{T}}^{-1}B$$\end{document}$ , and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\widehat{o}}(k)={\widehat{T}}^{-1} o(k)$$\end{document}$ . Since $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\widehat{\Lambda }}$$\end{document}$ is diagonal, each state in (10) evolves independently along its corresponding dynamic mode embedded in $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\widehat{T}}$$\end{document}$ . Similarly, for the fast dynamics, we compute $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\bar{\Lambda }}}$$\end{document}$ and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\bar{T}}$$\end{document}$ such that $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Q={\bar{T}}^{-1}{\bar{\Lambda }} ~{\bar{T}}$$\end{document}$ . These capture the concurrent interactions among brain regions by treating the slow dynamics as perturbations.

We use transformation matrices $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\widehat{T}}$$\end{document}$ and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\bar{T}}$$\end{document}$ as augmented features for subjects to perform fingerprinting, since they encode the dynamic modes of (1). Specifically, we determine the identity of a query sample by minimizing an alignment-based distance:

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \underset{s}{\arg \min }\quad \text {Dist}({\mathcal {T}}_{R_{h}^Q},~{\mathcal {T}}_{R_{s,h}^D}) \end{aligned}$$\end{document}

where $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathcal {T}}$$\end{document}$ is a feature set composing $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$[{\widehat{T}},~{\bar{T}}]\in {\mathbb {R}}^{m\times 2m}$$\end{document}$ , or a subset of these features, depending on which features best capture inter-subject variation. A detailed discussion on this will be given in the Results section. The distance metric $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\text {Dist}(\cdot ,\cdot )$$\end{document}$ is computed as the permutation-aligned pairwise vector distance, which compares two sets of vectors, where the elements are unordered. The key idea is to compute the cosine distance between elements in the two sets after finding an optimal permutation alignment that minimize overall distances, specifically, if defining $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathcal {T}}_{R_{h}^Q}=[x_1,\cdots ,x_{2m}],~{\mathcal {T}}_{R_{s,h}^D}=[y_1,\cdots ,y_{2m}]$$\end{document}$ ,

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \text {Dist}({\mathcal {T}}_{R_{h}^Q},~{\mathcal {T}}_{R_{s,h}^D})=\min _{\pi \in {\mathcal {Z}} } \sum _{i=1}^n \left( 1- \frac{x_i^{\top } y_{\pi (i)}}{\Vert x_i\Vert \Vert y_{\pi (i)}\Vert }\right) \end{aligned}$$\end{document}

where $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathcal {Z}}$$\end{document}$ denotes the set of all permutations of $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\{1, 2, \ldots , 2\,m\}$$\end{document}$ . Note that the eigenvalue matrices $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\bar{\Lambda }}}$$\end{document}$ and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\widehat{\Lambda }}}$$\end{document}$ quantify the significance of each mode in the dynamics. We, however, do not incorporate them directly in the fingerprinting task. Instead, our method treats all dynamic modes as equally informative, regardless of their eigenvalues. This contrasts with dimensionality-reduction methods like Principal Component Analysis (PCA), which retain only the most significant components. Our method, via (11), operates in the full space spanned by all dynamic modes, under the assumption that when different subjects perform the same fMRI task, their major dynamic modes might be similar but vary in certain minor dynamic modes.Fig. 3. Architecture of the five-layer GNN used for task fingerprinting

Task causal fingerprint via graph neural network

Unlike subject fingerprint, where the primary challenge arises from the large number of subjects and the limited data per subject, the goal of task fingerprint is to identify a task from a finite set of candidates. In this case, the data is richer, as each task is performed by all participating subjects. Nevertheless, task fingerprinting still faces intrinsic challenges due to inter-subject variability in brain structure and functional performance. As a result, methods relying solely on predefined algebraic operators, as discussed in Sect. 3.2, are no longer sufficient.

In this subsection, we implement a causal-based task fingerprinting by leveraging the extracted signatures $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$R_{s,h}$$\end{document}$ from Sect. 3.1. These signatures provide a graph-theoretic representation of brain networks, which we incorporate into a Graph Neural Network (GNN)-based learning model. To explain our method, we construct a labeled database:

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \widehat{{\mathcal {R}}}=\{R^{D}_{s,h}\}, \quad s\in {\mathcal {S}}, \text { and } h\in {\mathcal {H}} \end{aligned}$$\end{document}

where $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathcal {S}}$$\end{document}$ and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathcal {H}}$$\end{document}$ are subject sets and fMRI task sets, respectively. Given a query $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$R^Q$$\end{document}$ data with unknown identity and task, our goal is to determine the task index h of the query using the database $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\widehat{{\mathcal {R}}}$$\end{document}$ .

A key advantage of Graph Neural Networks (GNNs) is their ability to model input data as graph structures [53]. This aligns naturally with our data representation, where the matrix $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$R = [Q~A~B_{1}~B_{2}]$$\end{document}$ encodes causal interactions between brain regions. Each row of R captures how a specific brain region is influenced by other regions and external inputs. We define the GNN model $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathcal {G}}({\mathcal {V}}, {\mathcal {E}})$$\end{document}$ , where $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathcal {V}}$$\end{document}$ represents the set of nodes with $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$|{\mathcal {V}}| = m$$\end{document}$ . The feature of each node is a vector in $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathbb {R}}^{(2m+2n)}$$\end{document}$ , corresponding to each row of R. The normalized matrix A in R is used as the adjacency matrix to initialize the edge features $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathcal {E}}$$\end{document}$ .

Building on this input structure, we employ a five-layer Graph Neural Network architecture tailored for task fingerprinting from fMRI data. The network starts with a GATv2Conv layer that re-weights each node’s neighbours, emphasising task-relevant connections [54] (input 200, out 32 features with 4 attention heads, dropout 0.30). A parallel linear skip path (output 128 channels) is summed with the convolution result so the original signal remains visible to deeper layers, improving gradient flow and protecting against over-smoothing [55]. TopKPooling then selectively retains the most informative nodes (ratio 0.8), reducing graph complexity and focusing computation on salient sub-structures. This sparsification step improves both efficiency and performance. A second GATv2Conv block, again accompanied by a skip connection, further refines the retained features and maps them into a more discriminative space for task recognition (input 128, out 32 features with 2 heads, dropout 0.30; $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$32\times 2=64$$\end{document}$ output channels; skip output 64 channels). Global-mean pooling aggregates these refined node features across the graph, producing a single 64-dimensional vector. A linear head of size $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$64\times 8$$\end{document}$ , followed by log-softmax, converts this vector into task probabilities for training and evaluation. During training, we randomly drop $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$10\%$$\end{document}$ of edges to enhance robustness. The overall model structure is visualised in Fig. 3. The model is trained using supervised learning with data batches drawn from $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\widehat{{\mathcal {R}}}$$\end{document}$ . The rows of $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$R^D_{s,h}$$\end{document}$ serve as the input features for each node, while h is used as the corresponding fMRI task label. Detailed training parameters are provided in the Appendix.

New visualization: brain reachability landscape

In control theory, reachability refers to the ability of a control system to move from one state to another using admissible control inputs. In this subsection, we apply this concept to the causal dynamics in (1), which enables a novel visualization tool that characterizes the reachability of the brain state x(t). This generates a quantitative landscape of regional brain excitation level, which complements traditional connectivity measures that focus on the interaction patterns among brain regions.

To achieve this, we first rewrite the model (1) into an equivalent state evolution form:

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} x(t)&= f(x(t-1),u(t),u(t-1))\nonumber \\&= {\widehat{A}}\,x(t-1) + {\widehat{B}}_{1}\,u(t)+ {\widehat{B}}_{2}\,u(t-1), \end{aligned}$$\end{document}

where

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} {\widehat{A}}&\triangleq (I - Q)^{-1}A, \\ {\widehat{B}}_2&\triangleq (I - Q)^{-1}B_2, \\ {\widehat{B}}_1&\triangleq (I - Q)^{-1}B_1. \end{aligned}$$\end{document}

Compared to the original model (1), which explicitly represents the fast-slow interactions in brain dynamics, the reformulated model (12) integrates fast interactions directly into the state evolution. This facilitates a direct interpretation of how the system state evolves over time in response to control inputs.

Given a terminal time $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$T_M$$\end{document}$ , bounded energy on the control input sequence $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$U^{0:T_M-1}=~\begin{bmatrix} u(0)&u(1)&\cdots u(T_M-1) \end{bmatrix}$$\end{document}$ , and zero initial state, we define the fMRI reachability set as:

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned}&\quad {\mathcal {B}}\triangleq \bigcup x(T_M)\\ s.t.&\quad x(t) = f(x(t-1),u(t),u(t-1)),\\&\quad \Vert U^{0:T_M-1}\Vert _2\le 1,\\&\quad x(0)=0 \end{aligned}$$\end{document}

Here, the set $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathcal {B}}$$\end{document}$ quantifies the maximum possible activation of each brain region under bounded input energy. To compute $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathcal {B}}$$\end{document}$ , since $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x(T_M)$$\end{document}$ is a vector, one can optimize each of its entries individually (corresponding to specific brain regions). Given that the system dynamics $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$f(\cdot )$$\end{document}$ are linear and the constraints are convex, this optimization can be efficiently solved using linear programming.Fig. 4. Three-tier illustration mapping the reachable-set activation surface (top) onto a flattened 12 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\times$$\end{document}$ 12 cortical grid (middle) and, from there, onto its anatomical locations in a 3-D brain rendering (bottom). Colored tiles mark four representative regions

To visualize $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathcal {B}}$$\end{document}$ , we map its values onto a two-dimensional grid corresponding to the cortical parcellations (using a 90-region atlas arranged in a 12 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\times$$\end{document}$ 12 layout; see Fig. 4). Each region is colored according to its normalized reachability value, producing a reachability-based heatmap. An example of this visualization, using real-world fMRI data, is shown in Fig. 8, Sec. 5.2. Such heatmaps can be generated for each subject and each task condition to reveal the distinct spatial patterns of maximal reachability across brain regions. This offers a new way of brain network visualization, complementing standard correlation-based or connectivity-focused analyses that emphasize inter-regional interaction patterns.

Results

This section uses real-world datasets to verify the effectiveness of the proposed causal-based fingerprints for both subject and task fingerprints, as well as the reachability visualization.Fig. 5a Functional areas of the brain. b–d Ten Schaefer Parcellation Areas (blue dots) that are chosen as system inputs: two areas are associated with the prefrontal cortex; two areas are associated with the pre-motor cortex; two areas are associated with the somatosensory cortex; two areas are associated with the visual cortex; two areas are associated with the auditory cortex. We assume these are areas likely to ‘initiate’ certain brain activitiesTable 1Subject fingerprinting using multiple approachesMethodCM+MD&P (%)CM+CoR (%)CM+FN (%)CM+GNN (%)FC+CoR (%)FC+MD&P (%)FC+FN (%)FC+GNN (%)TSF (%)PH-H0 (%)Rest1 LR95.39658.39742.251 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$<\!5$$\end{document}$ 45.43969.73633.952 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$<\!5$$\end{document}$ 88.785.6Rest2 LR96.33461.29547.175 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$<\!5$$\end{document}$ 45.43670.92937.574 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$<\!5$$\end{document}$ 89.486.2Rest1 RL94.97057.11841.333 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$<\!5$$\end{document}$ 41.60367.94538.698 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$<\!5$$\end{document}$ 87.984.7Rest2 RL94.20358.97843.463 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$<\!5$$\end{document}$ 46.80367.77536.737 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$<\!5$$\end{document}$ 88.584.3CM causal dynamics model, FC functional connectivity, CoR correlation, MD&P modal decomposition and projection, GNN graph neural network, FN frobenius norm, PH-H0 persistent homology (H0 landscape) [47], TSF tangent-space fingerprinting [46]Table 2. Subject fingerprinting accuracy (%) comparing single- vs. two-timescale approachesMethodOne timescale (%)Two timescale fast + slow (%)Two timescale slow only (%)Rest1 LR31.54281.24595.396Rest2 LR30.93780.90496.334Rest1 RL29.88678.51794.970Rest2 RL30.12379.71094.203

Dataset and modeling

We utilize the Human Connectome Project (HCP) dataset [56], which provides fMRI time-series data for 391 unrelated subjects. Each subject participated in two resting-state fMRI sessions and seven distinct task-based fMRI sessions: emotional response (Emot), gambling (Gamb), language processing (Lang), motor function (Moto), relational processing (Rela), social cognition (Soci), and working memory (WMem). Each subject-task pair has two recordings, scanned in LR (left to right) or RL (right to left) patterns. All fMRI data were parcellated using the Schaefer-100 atlas [57], which partitions the cerebral cortex into $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p = 100$$\end{document}$ distinct regions. The fMRI time series for each region (parcel) was recorded at 720 ms intervals. In line with Remark 1, although higher-resolution parcellations are available, the 100-region resolution used here is sufficient for our purposes. This level of granularity allows us to investigate how the brain’s causal signatures can distinguish individual subjects and identify specific fMRI tasks, and our results confirm that this resolution is adequate for capturing those distinguishing signatures. In accordance with our model (Equation (1)), we select $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$n = 10$$\end{document}$ out of the 100 brain regions as input nodes. These selected regions are located in the brain’s premotor and sensory cortices (see Fig. 5), which is consistent with known functional hubs identified in [58]. Specifically, the input set includes two parcels from each of the following cortical areas: prefrontal cortex, premotor cortex, somatosensory cortex, visual cortex, and auditory cortex. They were chosen due to their pivotal roles in various cognitive and sensory functions, including high-level task management, memory recall, spatial processing, somatosensory processing, visual perception, and auditory processing [58]. The remaining $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$m = 90$$\end{document}$ brain regions are treated as the system’s state in our model.

Specifically, we use the minimally pre-processed resting-state runs released by the UCSD Library [59, 60] based on the HCP Young Adult raw data [61]. These scans have already undergone the standard HCP pipeline [62], which includes gradient-distortion and motion correction, EPI distortion correction, alignment to MNI space, high-pass filtering with a 2000-s cut-off, and ICA-FIX denoising [63]. After parcellation with the Schaefer-100 atlas, each run yields a $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$100\times 1190$$\end{document}$ matrix of regional time-series sampled every 0.72 s [64].Fig. 6. Left: Subject Fingerprinting based on CM+MD&P: The identification accuracy is compared with (black line) using the FC+CoR method. Right: Task Fingerprinting based on CM+GNN: The figure shows the correct/incorrect classification for the same/different fMRI tasks. The y-axis is split. The top segment (95–100%) shows the near-perfect accuracies, while the bottom segment (0–2%) zooms in on the small error rates

Subject causal fingerprinting

We first verify the effectiveness of the proposed Causal-Dynamics Model combined with Modal Decomposition and Projection (CM+MD&P) for subject fingerprinting. Previous research has established that resting-state fMRI shows considerable differences in brain activation patterns among individuals [65]. The dataset consists of two resting sessions, Rest1 and Rest2, each with two scanning orders, LR and RL. Preprocessing follows Sect. 4.1.

We run four cross-condition folds: each of the four resting scans (Rest1-LR, Rest1-RL, Rest2-LR, Rest2-RL) is taken in turn as the reference database, denoted as $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathcal {R}}_h$$\end{document}$ , while the remaining three scans serve as queries. This guarantees that reference and query data are always disjoint. Within each fold, model parameters $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$(Q,A,B_1,B_2)$$\end{document}$ are estimated only from the reference run; the ensuing modal matrices serve as one-shot “causal signatures” and are not updated during querying. Both baselines and the proposed method use the same four-fold split and identical preprocessing.

The subject fingerprint is obtained by solving (11). For the results presented below, we chose the feature set as $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathcal {T}}= [{\widehat{T}}]$$\end{document}$ , which relies only on the dynamic models of slow interaction and ignores the fast one. More details about this will be discussed in Remark 2. The subject fingerprint result, presented in Table 1, demonstrate that the causal fingerprinting method, specifically CM+MD&P, consistently achieves an accuracy rate above 94% for all resting sessions. The highest accuracy observed is 96.334% for the Rest2 LR session. This performance significantly surpasses that of the correlation-based baseline method, FC+CoR, which averages approximately 45% accuracy. Even though capturing causal interactions is harder, our method not only outperforms the FC+CoR baseline but also other alternative approaches such as CM+CoR, CM+FN, and FC+MD&P. Tangent-Space Fingerprinting (TSF) achieves $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$88\%$$\end{document}$ accuracy, about 6 percentage points behind CM+MD&P, because it models only the Riemannian geometry of static functional connectivity. The PH-H0 landscape method attains $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$85\%$$\end{document}$ ; its zero-dimensional topological summary likewise ignores directionality and temporal scale, leaving it 10 percentage points short of our causal approach. To ensure a fair comparison, we applied the CM+MD&P method as described in Sec. 3.2 and the FC+CoR method as outlined in the 2015 study by FES et al. [18] directly to our dataset, without any further adjustments or preprocessing. Although additional data processing could potentially improve the accuracy of all methods tested, our main goal is to confirm the viability and efficacy of causal signatures for individual identification. Therefore, we have deliberately avoided such optimizations to maintain an impartial comparison. To assess the generalizability of our approach, we ran experiments on a public dataset [66]. This dataset contains resting-state fMRI scans from healthy controls and Alzheimer’s disease patients. It does not include repeated sessions that match our original design. To work around this, we split one long resting-state scan into two separate segments. This produced two distinct datasets for analysis. Our method achieved nearly 99% accuracy in subject fingerprinting. These results highlight its strong performance and replicability across different populations and scanning protocols.

A detailed comparison is presented in Table 1, showing the average accuracy (across cases from Fig. 6-Left) for combinations of feature extraction methods (CM and FC) and classification methods (MD&P, CoR, FN, and GNN). Here, FN refers to the Frobenius norm $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Vert R^Q - R^D\Vert _F$$\end{document}$ , used to measure distances between query and database matrices for classification. The proposed CM+MD&P method achieves the highest performance among all combinations. Comparing FC+CoR and FC+MD&P suggests that MD&P can effectively evaluate similarities between FC matrices, leading to improved fingerprinting accuracy. The FN method is less robust than MD&P for classification. And the GNN method is generally unsuitable for subject fingerprinting due to limited data availability. We also tested other deep learning models with convolution or attention mechanisms and various layer designs, but they similarly showed poor performance.

Remark 2

(Single vs. two-timescale modeling and feature selection) To implement (11), the above result considers a two-timescale model, but uses only the slow dynamical modes, i.e., $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$(A, B_2)$$\end{document}$ , for subject fingerprinting. In addition to this configuration, one can also consider: (i) using a single-timescale state-space model, or (ii) using the full two-timescale dynamics, i.e., $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$(Q, A, B_1, B_2)$$\end{document}$ . In Table 2, we report the subject fingerprint accuracy achieved under these three configurations. The single-timescale model achieves approximately 30% accuracy, which indicates limited discriminative power when ignoring the multi-timescale nature of brain activity. Incorporating the full two-timescale dynamics significantly improves performance to around 80% (presented in our prior work [1]), which demonstrates the value of separating fast and slow modes. Surprisingly, the best results are achieved using only the reduced model $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$(A, B_2)$$\end{document}$ , which exceeds 94% accuracy across various resting-state permutations. This result suggests that while accounting for both fast and slow dynamics is necessary for enhancing performance, the most distinctive subject-specific information resides in the slower portion of these dynamics. A potential reason might be that fMRI data are typically dominated by hemodynamic responses spanning seconds, the faster sub-interval interactions and the simultaneous input effects tend to be noisier and harder to resolve.

Table 3. Task fingerprinting accuracy using multiple approachesMethodCM+GNNCM+RFCM+SVMFC+GNNR+GNNR+DNNR+DBRNNR+BAnDR+TCN–BiLSTMRest**100%**100%**99.232%97.307%96.629%95.164%94.355%**100%99.532%Emot99.872%99.488%85.677%89.160%69.238%72.912%84.757%98.493%93.445%Gamb96.122%74.168%93.094%75.553%81.683%68.535%69.023%94.176%89.395%Lang99.159%96.675%89.514%85.755%89.177%81.517%82.204%97.491%91.304%Moto96.733%80.562%96.675%77.982%76.858%66.374%77.773%95.754%88.992%Rela98.051%81.419%81.841%90.609%86.554%87.700%79.628%93.121%89.663%Soci97.544%75.703%77.749%69.811%73.706%71.455%69.950%96.385%92.118%WMem99.523%90.767%95.396%78.760%80.202%76.620%82.446%98.299%94.327%Average98.376%**87.350%89.897%83.117%81.756%78.929%80.017%96.710%92.347%Bold siginficance indicate the best value within each rowCM causal dynamics model, GNN graph neural network, DNN deep neural network [42]FC functional connectivity, SVM support vector machines, BAnD brain attend and decode [44]R raw data, RF random forest, DBRNN deep bidirectional recurrent neural network [43] TCN–BiLSTM temporal convolutional network–bidirectional long short–term memory [45]

fMRI task causal fingerprinting

We validate the effectiveness of the proposed Causal Modeling (CM) combined with Graph Neural Network (GNN) method for task fingerprinting, as presented in Table 3. Using causal signatures from the HCP dataset, we split the data evenly, with 50% allocated for training and 50% for testing. The model achieves perfect accuracy (100%) for Rest and near-perfect accuracy for Emot, Lang, WMem. However, tasks such Gamb, Moto, Rela, and Soci exhibit slightly lower performance, with most misclassifications occurring among these tasks.

To illustrate the strengths of our CM+GNN method, we compared it to several state-of-the-art approaches across eight tasks [43, 44, 67].2 Other columns in Table 3 report the average accuracy of these methods. Our CM+GNN method consistently outperforms others, achieving the highest average accuracy (98.376%), which highlights its robustness in task fingerprinting. For Emot and Lang, CM+GNN effectively identifies unique neural patterns, surpassing CM+RF and Raw+DBRNN. Performance decreases slightly for tasks like Gamb, Moto, and Soci, possibly due to overlapping neural activity. Similar declines are observed in other methods, such as Raw+DNN and Raw+DBRNN, for these challenging tasks.

We also evaluated the R + TCN–BiLSTM baseline, a Temporal Convolutional Network followed by bidirectional LSTM layers trained end to end on raw fMRI series [45]. Its mean accuracy of $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$92.3\%$$\end{document}$ is higher than the other raw-data deep nets (R + GNN, R + DNN, R + DBRNN) yet remains about 6 percentage points below CM + GNN. While the convolutional blocks capture local temporal motifs, the model lacks directed and multi-timescale information, limiting its discriminative power relative to our causal signatures.

To further assess the reliability of our causality-based approach, we tested it on an independent dataset [68]. The method achieved nearly 95% accuracy in task fingerprinting, confirming its consistency and wide applicability.

Effect of sampling interval \documentclass[12pt]{minimal}

			\usepackage{amsmath}
			\usepackage{wasysym} 
			\usepackage{amsfonts} 
			\usepackage{amssymb} 
			\usepackage{amsbsy}
			\usepackage{mathrsfs}
			\usepackage{upgreek}
			\setlength{\oddsidemargin}{-69pt}
			\begin{document}$$\Delta t$$\end{document}Δt on model fingerprinting accuracy

Note that the sampling interval $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Delta t$$\end{document}$ is critical to our discrete-time causal model. In this experiment, we adopt the native sampling interval $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Delta t = 0.72$$\end{document}$ s because it is the scanner’s repetition time (TR), as specified in the data description [61]. Repetition time is the time interval between the beginning of one MRI pulse sequence and the beginning of the next for the same slice. Staying at this original resolution avoids any resampling artifacts and preserves the rapid hemodynamic changes our two-timescale model is meant to capture. Table 4. Subject fingerprinting accuracy (%) under different sampling intervals $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Delta t$$\end{document}$ (seconds) $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Delta t$$\end{document}$ (s) $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\Delta t = 0.72}$$\end{document}$ (%) $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\Delta t = 1.44}$$\end{document}$ (%) $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\Delta t = 2.16}$$\end{document}$ (%)Rest1 LR95.39690.14380.636Rest2 LR96.33492.00281.452Rest1 RL94.97088.74778.890Rest2 RL94.20389.56377.938

Nevertheless, it is still important to discuss the sensitivity of our algorithm to the sampling interval $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Delta t$$\end{document}$ . Table 4 summarizes how resampling affects subject fingerprinting. Accuracy is computed for each of the four resting runs at three resolutions: the native 0.72 s, and down-sampled versions at 1.44 s (every second frame) and 2.16 s (every third frame).

The results in Table 4 confirm that accuracy peaks at the native interval ( $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Delta t = 0.72$$\end{document}$ s), the scanner’s inherent repetition time in the HCP protocol and therefore the finest temporal resolution available. Accuracy drops by roughly five percentage points when the data are down-sampled to 1.44 s and falls a further ten points at 2.16 s. The two-timescale model evidently relies on sub-second variations that are smoothed out at coarser resolutions, degrading the estimated matrices $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Q \& A$$\end{document}$ blocks. We therefore retain $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Delta t = 0.72$$\end{document}$ s for all reported experiments to preserve temporal fidelity while avoiding unnecessary information loss.

Task fingerprinting is less sensitive: accuracy falls from $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$98.376\%$$\end{document}$ at $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Delta t = 0.72$$\end{document}$ s to $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$96.442\%$$\end{document}$ at 1.44 s and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$92.127\%$$\end{document}$ at 2.16 s, suggesting that large-scale task-evoked activation patterns remain highly discriminative even when the temporal resolution is moderately reduced.

Visualization: directed causal interaction and reachability landscape

Fig. 7. Visualization of Brain Regions with Strongest Connection Weights, Showing Association Strengths Across Multiple Cognitive Tasks (Importance Scores Indicated by Color-Bar). LH refers to the left hemisphere, and RH refers to the right hemisphere of the brain. Arrows indicate directed relationships, illustrating which regions exert causal influence on others. For clarity of presentation, we employ a two-color scheme and apply a threshold to highlight only the most active brain areas and their directional interactions

Directed causal interaction

We use Fig. 7 to present the causal signatures extracted for the eight HCP tasks (Rest, Emot, Gamb, Lang, Moto, Rela, Soci, and WMem) and serve as the foundation for our functional interpretation. Before plotting, the matrices Q and A are rescaled to a same range, which enables cross-task comparison on a unified metric. Each entry of Q or A conveys both magnitude and sign, which reveals the direction and excitation/inhibition relations of causal drive between pairs of regions. We use arrows in the figures to indicate the directions of these causal influences, which are discussed in detail for each task in the subsequent analysis. By separating matrix Q and matrix A, we can observe how brain activity evolves over different timescales. Matrix Q captures fast synchronous interactions among brain regions, reflecting real-time activation patterns during specific tasks. In contrast, Matrix A illustrates slower dynamic evolutions, showing how current states are influenced by previous states.

For the resting state data, brain interactions concentrate within the default-mode network (DMN), especially the medial prefrontal cortex (mPFC) and posterior cingulate cortex (PCC), which support introspective thinking, self-related processing, memory integration and rest-task dichotomy [7]. The mPFC exerts influence over the PCC, modulating introspective and self-related processing. The emotion task shows strong connections in the amygdala, prefrontal cortex, and anterior insula. This is due to their roles in emotion generation and regulation. The gambling task primarily shows strong connections in the ventral striatum, prefrontal cortex, and insula, which are crucial for reward prediction and decision-making. The ventral striatum influences the prefrontal cortex and insula, affecting decision-making and reward processing. The language task primarily shows strong activations in Broca’s area (Brodmann areas 44 and 45 [69]) and Wernicke’s area (Brodmann area 22) in the left hemisphere. Broca’s area is crucial for language production, while Wernicke’s area is essential for language comprehension. Broca’s area influences Wernicke’s area, facilitating coordinated language processing. Motor execution engages the primary motor cortex, supplementary motor area (SMA), and basal ganglia. This reflects their active roles in movement planning and control. Causal arrows show the primary motor cortex directing the SMA as actions unfold. For the relational reasoning task, the dorsolateral prefrontal cortex (DLPFC) and posterior parietal cortex provide the principal scaffold for working memory, abstraction, and relational encoding. The DLPFC leads the parietal cortex to facilitate complex cognitive manipulation. Social cognition involves strong coupling between the mPFC and superior temporal sulcus (STS). These are regions implicated in theory of mind and socio-emotional appraisal. The mPFC drives the STS to modulate social evaluative processing. During working memory maintenance, robust edges connect the DLPFC, parietal cortex, and anterior cingulate cortex (ACC). This aligns with their critical roles in information storage and monitoring. Here, the DLPFC influences parietal regions to sustain mnemonic content.

It can be observed that certain tasks share overlapping strong connections in specific regions. For example, the gambling, social, and emotional tasks all rely on pathways linking the prefrontal cortex and insula, caused by the collaborative role of these regions’ in risk-related decision making, emotion processing, and social cognition. Language and relational tasks converge within left temporal structures, notably Wernicke’s area and the hippocampus, which highlights their joint involvement in linguistic analysis and declarative memory. Motor and working memory conditions depend on coordinated activity between the DLPFC and parietal cortex, underscoring these regions shared contribution to motor sequencing and the active maintenance of information. This might be one of the reasons for the performance degradation (commonly shared by most methods) for task fingerprinting.Fig. 8. Reachability-based heatmaps for one subject across eight cognitive tasks (Rest, Emot, Gamb, Lang, Moto, Rela, Soci, and WMem). Brighter colors (yellow) indicate stronger maximum neural activations, while darker tones (blue) reflect lower excitability. Red dashed circles highlight representative activation hotspots corresponding to critical brain regions known for each cognitive task. The black boundary outlines the edge of the flattened cortical map. The correspondence between this reachability representation and anatomical brain regions is illustrated in Fig. 4. These patterns highlight functional specialization differences across diverse cognitive tasks

Reachability landscape

Fig. 8 illustrates reachability-based heatmaps describing the potentially highest activation level for a single subject across eight cognitive modes: Rest, Emot, Gamb, Lang, Moto, Rela, Soci, and WMem. Each heatmap represents normalized activation levels computed from a 90-region cortical parcellation arranged on a 12 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\times$$\end{document}$ 12 spatial grid. Brighter areas (yellow) indicate higher neural excitability under bounded input conditions, whereas darker regions (blue) denote lower activation potentials. During the resting state (Rest), the default-mode network (DMN) shows the strongest activation. The medial prefrontal cortex (mPFC) and posterior cingulate cortex (PCC) dominate this pattern [70]. During emotional processing (Emot), the limbic system becomes highly active. The ventromedial prefrontal cortex (vmPFC), amygdala, and insula lead this response [71]. In the gambling task (Gamb), the orbitofrontal cortex and ventral striatum show clear clusters of activation [72]. These clusters signal heightened responsiveness linked to reward anticipation and decision-making. Language-related activations (Lang) appear primarily within classical left-lateralized perisylvian areas, prominently involving Broca’s area (inferior frontal gyrus) and Wernicke’s area (superior temporal gyrus) [73]. Motor execution (Moto) triggers the motor network. The primary motor cortex, supplementary motor area, and nearby premotor regions show strong excitation [74]. Relational reasoning (Rela) engages the frontoparietal control network. The dorsolateral prefrontal cortex (dlPFC) and posterior parietal cortex display distinct activation foci [75]. Social cognition (Soci) relies on theory-of-mind regions. The medial prefrontal cortex and temporoparietal junction (TPJ) dominate the pattern [76]. Working memory (WMem) activates a frontoparietal network. The dorsolateral prefrontal cortex and superior parietal cortex show synchronized engagement [77]. These reachability-based visualizations provide individualized activation maps. They validate established neurofunctional models across cognitive domains. They also demonstrate the capability of reachability analysis for revealing concentrated brain regions under different modes.

Conclusion

To the best of our knowledge, we are among the first to investigate and quantify large-scale brain fingerprints in the context of causal dynamics. This paper proposed new methods that employ an implicit-explicit discretization scheme to derive a new state-space model, which captures causal signatures as directed interactions with two-timescale temporal resolution. The causal dynamics model also enabled a novel visualization tool: the reachability landscape of brain states, which quantitatively represents the possible activation levels that the brain regions can reach under various fMRI tasks. The effectiveness of the proposed methods and visualization tools was validated using real-world datasets and through comparisons with other state-of-the-art techniques.

Despite the novelty of the proposed approach, we remark on the following limitations: (i) In our approach, system inputs were manually specifically selected based on different functional brain regions, while other ways of choosing system state input regions may potentially further improve the performance. (ii) While our subject fingerprinting method is model-based and generalizable without training, the task fingerprinting method relies on training a Graph Neural Network (GNN), which may limit its generalization across datasets. (iii) Subject fingerprinting shows measurable sensitivity to the sampling interval $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Delta t$$\end{document}$ (coarser sampling degrades accuracy).

Regarding clinical applications, our proposed method has the potential for the development of diagnostic and prognostic tools for neurodegenerative diseases such as Alzheimer’s Disease (AD). By analyzing functional and directional influences between brain regions, the proposed approach can associate changes in brain network dynamics with the emergence of clinical symptoms.

In future work, we plan to: (i) Explore causal fingerprints and the reachability landscape concept in both healthy controls (e.g., through other open neuroimaging datasets) and in cases of neurodegenerative diseases such as Alzheimer’s disease [11, 55, 78]; (ii) Enhance the generalizability of our task fingerprinting approach, potentially through transfer learning techniques to accelerate adaptation to new datasets. (iii) Systematically assess robustness to choices of sampling interval $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Delta t$$\end{document}$ , parcellation, and preprocessing, and report performance in a manner that is independent of the acquisition and processing pipeline; iv) Extend the two-timescale linear model to piecewise linear or mildly nonlinear dynamics with uncertainty quantification, and replace manual selection of input regions with a data-driven procedure to reduce design bias.

Bibliography7

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Duong-Tran D, Nguyen N, Mu S, Chen J, Bao J, Xu F, Garai S, Cadena-Pico J, Kaplan AD, Chen T, et al. A principled framework to assess information theoretical fitness of brain functional sub-circuits. ar Xiv preprint ar Xiv:2406.18531
2Nguyen N, Hou T, Amico E, Zheng J, Huang H, Kaplan AD, Petri G, Goñi J, Kaufmann R, Zhao Y, et al. Volume-optimal persistence homological scaffolds of hemodynamic networks covary with meg theta-alpha aperiodic dynamics. In: International conference on medical image computing and computer-assisted intervention; 2024. pp. 519–529. Springer.
3Wang X, Cortes J. Efficient reconstruction of neural mass dynamics modeled by linear-threshold networks. ar Xiv preprint ar Xiv:2308.14231. 2023.
4Tipnis U, Abbas K, Tran E, Amico E, Shen L, Kaplan AD, Go IJ. Derived products from hcp-ya f MRI. Lawrence Livermore National Laboratory (LLNL) Open Data Initiative. 2022. https://library.ucsd.edu/dc/object/bb 59818382
5Tipnis U, Abbas K, Tran E, Amico E, Shen L, Kaplan AD, Goñi J. Functional connectome fingerprint gradients in young adults. ar Xiv preprint ar Xiv:2011.05212 2020.
6Ugurbil K, Van Essen D. 1200 Subjects Data Release. https://www.humanconnectome.org/storage/app/media/documentation/s 1200/HCP_S 1200_Release_Reference_Manual.pdf
7Foret P, Kleiner A, Mobahi H, Neyshabur B. Sharpness-aware minimization for efficiently improving generalization. ar Xiv preprint ar Xiv:2010.01412 2020.