Importance of localized dilatation and distensibility in identifying determinants of thoracic aortic aneurysm with neural operators

David S. Li; Somdatta Goswami; Qianying Cao; Vivek Oommen; Roland Assi; Jay D. Humphrey; George E. Karniadakis

PMC · DOI:10.1371/journal.pcbi.1013550·October 9, 2025

Importance of localized dilatation and distensibility in identifying determinants of thoracic aortic aneurysm with neural operators

David S. Li, Somdatta Goswami, Qianying Cao, Vivek Oommen, Roland Assi, Jay D. Humphrey, George E. Karniadakis

PDF

Open Access

TL;DR

This study shows that combining shape and mechanical data improves the identification of causes behind thoracic aortic aneurysms, potentially leading to better personalized treatments.

Contribution

The study introduces a novel approach using neural operators to integrate dilatation and distensibility data for predicting TAA causes.

Findings

01

Prediction errors are significantly lower when using both dilatation and distensibility data compared to dilatation alone.

02

UNet is identified as the best-performing neural network architecture for this task.

03

Full-field measurements of dilatation and distensibility are crucial for identifying the underlying pathologic mechanisms of TAAs.

Abstract

Thoracic aortic aneurysms (TAAs) stem from diverse mechanical and mechanobiological disruptions to the aortic wall that can also increase the risk of dissection or rupture. There is increasing evidence that dysfunctions along the aortic mechanotransduction axis, including reduced integrity of elastic fibers and loss of cell-matrix connections, are particularly capable of causing thoracic aortopathy. Because different insults can produce distinct mechanical vulnerabilities, there is a pressing need to identify interacting factors that drive progression. In this work, we employ a finite element framework to generate synthetic TAAs arising from hundreds of heterogeneous insults that span a range of compromised elastic fiber integrity and cellular mechanosensing. From these simulations, we construct localized dilatation and distensibility maps throughout the aortic domain to serve as…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Diseases4

rupture thoracic aortopathy aorta TAA

Figures8

Click any figure to enlarge with its caption.

Fig 1 — Synthetic data generation pipeline.(a) Insult distributions along the circumferential (θ) and axial (z) directions are randomly generated with Gaussian random field (GRF) methods to define normalized insult profiles (ϑ*(θ,z)∈[0,1]) that are mapped to the initial loaded aortic geometry to define the insult region (see S1 Supporting information for further details). (b) For each profile, ϑ* delineates multiple cases of mechanobiological insults defined by combinations of compromised integrity of elastic fibers (ϑce(θ,z)∈[0,0.48]) and dysfunctional mechanosensing (ϑδ(θ,z)∈[0,0.28]) as inputs to (c) the nonlinear FE simulation to compute the long-term evolved state of the TAA. (d) Maps for dilatation (d) and distensibility (𝒟) are obtained from the final geometry under multiple in vivo loading conditions (diastolic and systolic pressures), either processed as heat maps or converted to 8-bit grayscale maps, to serve as training data for the neural networks. ϑce+ϑδ combinations are constrained to produce closely matched maximum dilatations (dmax≈1.5) for each case to focus attention on detecting the underlying pathologic mechanism at the time when a dilatation first reaches aneurysmal status.

Fig 2 — Schematic representation of two DeepONet architectures.Each branch net is a (a) CNN or (b) FNN that embeds the dilatation (d) and distensibility (𝒟) maps as either grayscale or heat maps. Each trunk net takes the coordinates {θ^,z^} to define the output domains of the corresponding insult contributor. The solution operators for each insult (𝔾θi (i=ce,δ)) are formed from element-wise dot products of the outputs of the branch and trunk networks, with shared learnable parameters (θ). Minimization of the loss function (Lθ), defined as the combination of both operator outputs, determines the optimal parameters that enable estimation of the insult profiles and contributors (ϑi^).

Fig 3 — Schematic representation of the UNet architecture.Maps of dilatation (d) and distensibility (𝒟) are encoded via successive layers of two-dimensional convolution (Conv2D), group normalization, and Gaussian Error Linear Unit (GELU) activation. Down- and up-sampling the input by factors of 2 is achieved through two-dimensional max-pooling operations (Maxpool2D) and two-dimensional transpose convolutional operations (Conv2DTranspose), respectively. Finally, skip connections are implemented to propagate information from earlier layers.

Fig 4 — Schematic diagram of the LNO architecture.The dilatation (d) and distensibility (𝒟) are lifted to a higher dimension via shallow neural network 𝒫, to which Laplace layers are applied, each yielding the output u(t)=f[Kϕ(s)V(s)], where Kϕ(s) is the Laplace transform of the kernel integral transformation and V(s) is the Laplace transform of the lifted input function. Each layer contains pole-residue methods to obtain the transient and steady-state responses (utr(t) and ust(t), respectively), with residues βn, coefficients αℓ, and response poles γn and λℓ. Finally, the outputs (insult profiles) are projected back into the target dimension using the shallow network 𝒬.

Fig 5 — Performance of four different neural networks in predicting insult contributors to aneurysmal dilatation.Relative ℒ2 errors are reported separately for compromised (a) elastic (eln) fiber integrity and (b) mechanosensing over all testing cases considered: dilatation (d) only in grayscale and heat map formats, and dilatation and distensibility (d & 𝒟) in grayscale and heat map formats.

Fig 6 — Predictions from all four architectures for an elastic fiber integrity-dominated combined insult.(a) The ground truth combined insult field consisted of contributions of both compromised elastic fiber integrity (ϑce) and dysfunctional mechanosensing (ϑδ) superimposed in the FE simulation to generate the TAA with dilatation and distensibility profiles shown in (b–c). Predictions (ϑ^i) and absolute errors (ϑ^i−ϑi (i=ce,δ)) are shown for CNN-DeepONet, FNN-DeepONet, UNet, and LNO trained on (b) dilatation grayscale maps only and (c) dilatation and distensibility grayscale maps.

Fig 7 — Predictions from all four architectures for a mechanosensing-dominated combined insult.Similar to Fig 6 except for heat map inputs. (a) The ground truth combined insult field consisted of contributions of both compromised elastic fiber integrity (ϑce) and dysfunctional mechanosensing (ϑδ) superimposed in the FE simulation to generate the TAA with dilatation and distensibility profiles shown in (b–c). Predictions (ϑ^i) and absolute errors (ϑ^i−ϑi (i=ce,δ)) are shown for CNN-DeepONet, FNN-DeepONet, UNet, and LNO trained on (b) dilatation heat maps only and (c) dilatation and distensibility heat maps.

Fig 8 — Effects of grayscale versus heat map data inputs.All networks were trained on dilatation (d) maps only. (a) Ground truths for elastic fiber integrity and mechanosensing contributions. Predictions (ϑ^i) and computation of absolute errors (ϑ^i−ϑi (i=ce,δ)) are shown for (b) CNN-DeepONet, (c) FNN-DeepONet, (d) UNet, and (e) LNO.

Funding2

—http://dx.doi.org/10.13039/100000002National Institutes of Health
—http://dx.doi.org/10.13039/100000002National Institutes of Health

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAortic aneurysm repair treatments · Aortic Disease and Treatment Approaches · Elasticity and Material Modeling

Full text

Introduction

Treatment planning for individuals with thoracic aortic aneurysm (TAA), a localized dilatation resulting from underlying microstructural damage, continues to be based primarily on aortic size and growth rate [1], with maximum size being the de facto predictor. Nevertheless, life-threatening aortic events occur below established thresholds [2,3]. Importantly, there is increasing evidence that dysfunction can occur at several points along the mechanotransduction axis [4–8] that informs aortic cells responsible for maintaining healthy mural structure and function. These may include reduced or dysfunctional fibrillin-1 that stabilizes elastic fibers in the aortic wall [9,10], loss of cell-matrix connections (microfibrils and integrin binding sites) required for accurate assessment of wall stress and assembly of extracellular matrix [11,12], disrupted deposition or organization of fibrillar collagens [13], aberrant transforming growth factor-beta signaling [14], and altered cellular contractility [15,16]. Many of these effects have been shown to result in aneurysmal dilatation; yet, because different insults may lead to lesions of similar size but vastly different mechanical strength and vulnerability, there is a critical need to understand the underlying biomechanical and mechanobiological mechanisms that drive disease progression in a patient-specific manner. Moreover, since mechanobiological function (and its compromise) cannot be directly evaluated in vivo, such insight must be gained using often-limited, minimally-invasive clinical information following diagnosis.

Toward this end, we recently assessed the performance of neural network models in identifying factors that could contribute to TAA in a synthetic dataset resulting from finite element (FE) simulations, using metrics that can be derived simply from three-dimensional medical image reconstructions [17]. Briefly, a simulation platform for aortic growth and remodeling (G&R) [18] was used to generate aneurysms from randomly distributed, localized insults to structural constituents and mechanobiological mechanisms in a healthy aortic model. Local dilatation (normalized inner radius) and distensibility (normalized difference in radius between systole and diastole) fields from the resulting TAAs were used as training data for a deep operator network (DeepONet) [19] in the form of two-dimensional maps, which revealed the efficacy of a convolutional neural network (CNN)-based DeepONet architecture in predicting the associated initiating insult profile for a given dilatation-distensibility pair with a high degree of accuracy, thereby establishing the feasibility of evaluating mechanobiological compromise using image-based information.

Yet, experimental observations from disease models usually cannot be captured with a computational framework without consideration of multi-contributor insults [20,23]. For instance, recent work elucidating mechanobiological drivers of TAA in the genetic condition Marfan syndrome [21,22] identified several interconnected effects in its natural history, including both structural (e.g., elastic and collagen fibers) and biological (e.g., cellular mechanosensing and mechanoregulation of matrix) impairments in severe disease [10,23]. It is clear that synthetic data for training surrogate models must similarly consist of combined effects to yield clinically useful predictions. To more closely represent the scope of in vivo pathologies leading to TAA, we now build on this established framework by introducing additional factors to our synthetic data generation pipeline, including parameterization of the model to ascending aortic biomechanical properties of a mouse model of TAA (prone to dissection and rupture), as well as, importantly, focusing on aneurysms resulting from multiple superimposed initiators.

Furthermore, looking beyond the DeepONet framework, we wish to evaluate a range of candidate neural operators varying in type and architecture (including UNets and Laplace Neural Operators) that may be best suited for these applications. We compare the performance of several architectures in identifying these multi-contributor insult profiles from maps of local dilatation and distensibility. Importantly, to bracket the range of predictive capability in the lower limit of available clinical information, each network is also trained with knowledge of dilatation alone. We show that both dilatation and distensibility information are necessary for accurate estimation of combined insult profiles, especially for TAAs that exhibit similar maximum dilatations. When assessing the predictive accuracy of these models when trained on both dilatation and distensibility data versus dilation data only, we find that the prediction accuracy is significantly lower when relying solely on dilatation across all networks. Additionally, models based on convolutional neural networks, particularly UNets, exhibit the best performance in determining mechanobiological insult magnitude and distribution, serving as a promising tool for predicting TAA determinants and gaining insight into patient risk.

Materials and methods

Generation of synthetic data

Computational studies on TAA have investigated the propensity of multiple localized perturbations in aortic mechanics and mechanobiology to initiate and propagate dilatations in the aorta, including loss of elastic fiber integrity [20,24,25], reduced collagen cross-linking [26,27], dysfunctional mechanosensing [4], and impaired mechanoregulation of collagen fibers [28]. While these effects have largely been studied individually, with some recent effort devoted toward combining them to capture experimental observations [23,29], neural network models have yet to be trained on data comprised of these interacting contributors. We leverage our existing pipeline described elsewhere [18,30–33], built upon the notion of mechanobiological homeostasis and its loss in the aorta, with key extensions detailed below. Further information may be found in S1 Supporting information.

Contributing insults.

We focus on two primary classes of contributing factors to TAA [17], namely reduced integrity of elastic fibers in the aortic wall and compromised mechanosensing of intramural cells responsible for modulating the mechano-response of the aorta. Both effects have been shown to play critical roles in aneurysmal progression, leading to localized dilatation, abberant elastic energy storage, and increased circumferential stiffness [18,23,34]. As in previous work, insults to elastic fibers take the form of prescribed reductions in the stiffness of the elastin-dominated extracellular matrix (Table A in S1 Supporting information), particularly impacting the energy storage capability of the aorta required for the windkessel effect in normal function. Mechanosensing insults are characterized by scaling the deviation in the intramural stress $[eqn]$ that is “sensed” by aortic cells, where σ is a scalar measure and $[eqn]$ is the homeostatic “set-point” that the cells work to maintain by modulating cell and matrix turnover in the aortic wall. Specifically, a coefficient $[eqn]$ is introduced in the expression to give $[eqn]$ (with $[eqn]$ being perfect mechanosensing in the normal aorta), capturing impairment of the ability of the aorta to effectively modify the surrounding matrix constituents in response to elevated intramural stress [12,18,23].

FE simulation and dilatation/distensibility maps.

To generate synthetic training data, we employed a well-established computationally efficient FE model for determining the long-term, mechanobiologically equilibrated evolution of TAAs [18], with baseline parameters estimated to reproduce the in vivo behavior of a non-dilated ascending aorta of a mouse model of Marfan syndrome under normotensive conditions [23]. We used our recently developed method for mapping spatially heterogeneous perturbations generated via Gaussian random fields to produce randomly distributed insult profiles $[eqn]$ (Fig 1a), which indicate the local normalized severity of mechanobiological compromise throughout the initial aortic domain. Each profile defines spatially varying multi-contributor insults by superimposing defects in elastic fiber integrity $[eqn]$ and mechanosensing $[eqn]$ (Fig 1b). Each contributor pair was provided as input to the FE TAA simulation (Fig 1c) to compute the steady-state evolved geometry, at which point the G&R evolution was halted, and we simulated changes in luminal pressure over a cardiac cycle. Dilatation d was defined as the local inner radius r from the aortic centerline normalized by the average inner radius at the vessel ends, and distensibility $[eqn]$ was defined as the normalized change in inner radius between systolic (S) and diastolic (D) loading (Fig 1d, cf. [17]), both of which can be computed from a cardiac gated medical image. The magnitudes of the insult contributors $[eqn]$ were constrained to produce maximum dilatations of approximately 1.5 (defined as aneurysmal), with closely matched dilatation values across all insult combinations to focus attention on detecting the underlying pathologic mechanism at the time that a dilatation first reaches aneurysmal status, a critical time in clinical treatment planning (Fig A in S1 Supporting information displays the differential effects of a wider range of combinations of $[eqn]$ on dilatation and distensibility). We note that, although TAA growth was simulated under systolic loading, we selected maps of dilatation at diastolic pressure for training data, with the aim to emulate what can easily be computed from a standard clinical image (tending to capture anatomy around end-diastole). We generated 100 unique spatial distributions (Fig B in S1 Supporting information), assigning 5 combinations of compromised elastic fiber integrity and mechanosensing to each profile, yielding 500 final dilatation-distensibility map pairs.

Synthetic data generation pipeline.(a) Insult distributions along the circumferential (θ) and axial (z) directions are randomly generated with Gaussian random field (GRF) methods to define normalized insult profiles (ϑ(θ,z)∈[0,1]) that are mapped to the initial loaded aortic geometry to define the insult region (see S1 Supporting information for further details). (b) For each profile, ϑ* delineates multiple cases of mechanobiological insults defined by combinations of compromised integrity of elastic fibers (ϑce(θ,z)∈[0,0.48]) and dysfunctional mechanosensing (ϑδ(θ,z)∈[0,0.28]) as inputs to (c) the nonlinear FE simulation to compute the long-term evolved state of the TAA. (d) Maps for dilatation (d) and distensibility (𝒟) are obtained from the final geometry under multiple in vivo loading conditions (diastolic and systolic pressures), either processed as heat maps or converted to 8-bit grayscale maps, to serve as training data for the neural networks. ϑce+ϑδ combinations are constrained to produce closely matched maximum dilatations (dmax≈1.5) for each case to focus attention on detecting the underlying pathologic mechanism at the time when a dilatation first reaches aneurysmal status.*

FE simulation and dilatation/distensibility maps.

Numerical experiments

Knowledge of the underlying mechanical and mechanobiological insults for a given TAA could provide valuable insight into aortic vulnerability and optimal interventions. With this goal of predicting the relative contributions of both initiating insults based only on imaging-derived quantities, we compared several candidate neural operator architectures (two forms of a Deep Operator Network, UNet, and Laplace Neural Operator, described below) and input data formats to establish a standard for handling subject-specific dilatation and distensibility information.

Input data formats.

Dilatation only vs. dilatation and distensibility.

In order to minimize radiation dose or imaging time, information is typically not acquired at multiple phases over the cardiac cycle; thus, only one image per clinical time point is available. Analogous to the current standard for surgical evaluation, we compared the performance of each network in predicting elastic fiber integrity and mechanosensing insults when trained only on dilatation maps (clinical standard) versus both dilatation and distensibility maps (requiring data over a cardiac cycle).

Heat maps vs. grayscale maps.

We recently developed competing data preprocessing methods for constructing dilatation and distensibility maps. We interpolated d and $[eqn]$ , evaluated on a nodal basis from the FE simulations in the θ–z domain, to $[eqn]$ uniform grids either as the physical quantities (here named “heat maps”) or after normalizing and converting to 8-bit integer ([0 255]) intensity fields (here named “grayscale maps”), with dilatation maps sharing a preset range across all cases and distensibility maps each sharing a separate range across all cases. This approach was taken to allow straightforward integration with diverse neural operator architectures (discussed below), where heat maps are well-suited for feed-forward neural networks and grayscale maps for convolutional networks, as done previously [17]. Importantly, we also sought to evaluate our former best-performing architecture, based on grayscale map inputs, against alternative network designs.

Network architectures.

Deep Operator Network (DeepONet).

DeepONet consists of branch networks to encode the input data and a trunk network to define the output domain, allowing a resolution-independent representation of input and output functions [19]. Building on our previous framework, we compared the performance of two DeepONets that differ in branch net architecture, with $[eqn]$ dilatation and distensibility maps serving as inputs, encoded via either fully connected convolutional neural networks (CNNs) (Fig 2a) or feed-forward neural networks (FNNs) (Fig 2b). In both DeepONets, the trunk net received initial positions of the aortic domain $[eqn]$ as input (as cylindrical coordinates), using an FNN. We defined separate solution operators for each insult contributor that served as inputs to the combined loss function with a single set of trainable parameters (mean squared error), minimization of which allowed estimation of the insult profiles for elastic fiber integrity and mechanosensing.

Schematic representation of two DeepONet architectures.Each branch net is a (a) CNN or (b) FNN that embeds the dilatation (d) and distensibility (𝒟) maps as either grayscale or heat maps. Each trunk net takes the coordinates {θ^,z^} to define the output domains of the corresponding insult contributor. The solution operators for each insult (𝔾θi (i=ce,δ)) are formed from element-wise dot products of the outputs of the branch and trunk networks, with shared learnable parameters (θ). Minimization of the loss function (Lθ), defined as the combination of both operator outputs, determines the optimal parameters that enable estimation of the insult profiles and contributors (ϑi^).

UNet.

UNets are U-shaped fully convolutional neural networks, originally developed for biomedical image segmentation applications [35]. UNets are also the building blocks of the diffusion models [36] used in generative AI packages like DALL.E [37]. The extensive applications of UNets are rooted in their ability to learn the mapping from input to output signals through latent representations at varying degrees of coarseness, with coarser representations responsible for learning low-frequency components of the solution and finer representations responsible for high-frequency components. The multigrid representation learning makes the UNet effective in extracting spatiotemporal correlations entangled in the solutions of PDEs [39,40], subsequently leading to widespread applications from materials science [41] to turbulence modeling [42]. We provided the dilatation and distensibility information as $[eqn]$ inputs to a UNet that learned the mapping to the corresponding insult profiles (Fig 3).

Schematic representation of the UNet architecture.Maps of dilatation (d) and distensibility (𝒟) are encoded via successive layers of two-dimensional convolution (Conv2D), group normalization, and Gaussian Error Linear Unit (GELU) activation. Down- and up-sampling the input by factors of 2 is achieved through two-dimensional max-pooling operations (Maxpool2D) and two-dimensional transpose convolutional operations (Conv2DTranspose), respectively. Finally, skip connections are implemented to propagate information from earlier layers.

Laplace Neural Operator (LNO).

The Laplace Neural Operator is an architecture that performs operator learning in the Laplace domain for solving ordinary and partial differential equations [43]. This architecture leverages the solution of PDEs represented by the integral of Green’s function and the kernel integral is transformed and calculated in the Laplace domain. A key innovation in LNO is its Laplace layer, which employs an analytical pole-residue operation to establish a physically interpretable and meaningful mapping between the input and output functions in the Laplace domain. By independently learning the steady-state response, transient response with zero initial conditions, and transient response under nonzero initial conditions, LNO effectively captures true system dynamics. This enables it to achieve better approximation accuracy compared to other neural operators for extrapolation circumstances and dynamical systems. In this study, four Laplace layers were chosen, with width and modes for each layer being 32 and 8, respectively (Fig 4).

Schematic diagram of the LNO architecture.The dilatation (d) and distensibility (𝒟) are lifted to a higher dimension via shallow neural network 𝒫, to which Laplace layers are applied, each yielding the output u(t)=f[Kϕ(s)V(s)], where Kϕ(s) is the Laplace transform of the kernel integral transformation and V(s) is the Laplace transform of the lifted input function. Each layer contains pole-residue methods to obtain the transient and steady-state responses (utr(t) and ust(t), respectively), with residues βn, coefficients αℓ, and response poles γn and λℓ. Finally, the outputs (insult profiles) are projected back into the target dimension using the shallow network 𝒬.

Training

The 500 FE simulations were randomly categorized into 450 training and 50 testing samples. We then evaluated the prediction accuracy of each architecture-input data combination with a relative $[eqn]$ error computed over all testing samples, with separate errors for compromised elastic fiber integrity and mechanosensing. We also computed point-wise absolute errors in each predicted insult profile ( $[eqn]$ , $[eqn]$ , where $[eqn]$ is the predicted insult profile and $[eqn]$ is the ground truth) to assess the ability of each network to reproduce the insult spatial distributions.

For the DeepONets, the initial architecture, learning rate, and number of training epochs were selected based on commonly used configurations reported in prior work [17]. Building on this baseline, we conducted a grid search over activation functions, network depth, and width, while monitoring loss curves to ensure convergence and mitigate overfitting. The final architecture was chosen based on the configuration that yielded the lowest validation loss within a predefined number of epochs (200,000), resulting in an Adam optimizer with the learning rate set at 0.001. In the cases of dilatation-only training, the branch net encoding distensibility was omitted from both architectures. For the UNet, the batch size was set to 250 and trained for 100,000 epochs. Following [39], we used GELU activations [38] and a cosine-annealing-based learning rate scheduler with an initial learning rate of 0.0001. For the LNO, the dilatation and distensibility information were provided as the inputs with dimensions $[eqn]$ , and the two insults were the outputs with dimensions $[eqn]$ , where 20 is the batch size, 41 is the 2D resolution of the map, and 2 is the number of channels. Both the inputs and outputs were normalized by min-max normalization during training. The relative $[eqn]$ error between the predicted insults and the true insults was used as the loss function. We first adopted commonly used settings from similar works [43] and performed a series of experiments to ensure convergence without overfitting, yielding 10,000 epochs and an exponential decay learning rate schedule with an initial learning rate of 0.001. The best model during these 10,000 epochs was chosen as the final surrogate model. Relevant network parameters are summarized in Table 1.

Table 1: Parameters for all network architectures.Note that the number of weight updates does not correspond to the number of epochs.

Results

Effects of insult contributors on dilatation and distensibility

FE simulations of TAAs initiated from combinations of compromised elastic fiber integrity and mechanosensing equilibrated with normalized inner radius ranging from 0.973 (at the vessel ends) to 1.657 (at the apex of the insult region), with a mean maximum dilatation of 1.496 ± 0.0476. Insult profiles having a greater span in the circumferential direction generated greater levels of dilatation, consistent with previous investigations [17]. The initial non-aneurysmal aorta exhibited a distensibility of 0.05442 (i.e., 5.4%), while aneurysmal dilatation decreased distensibility within the insult region, with a mean minimum distensibility of 0.0344 ± 0.0038 (i.e., 3.4%), corresponding to a ∼37% reduction associated with the ∼50% dilatation.

We further observed differential effects on the maximum dilatation and minimum distensibility depending on the dominating contributor to the overall insult. Although all combinations of compromised elastic fiber integrity and mechanosensing resulted in significant dilatation and decreased distensibility, mechanosensing-dominated insults tended to have consistently high dilatation, while elastic fiber integrity-dominated insults corresponded to lower distensibility within the insult region. Regions opposite the location of minimum distensibility also exhibited reduced distensibility despite not directly experiencing a prescribed insult, which occurred for all combined insults considered. Representative dilatation and distensibility profiles corresponding to the full range of combined insult magnitudes can be seen in Fig A in S1 Supporting information.

Overall prediction accuracy

Following training with the same 450 simulations, the four neural operator architectures were used to predict the remaining 50 insult fields of compromised elastic fiber integrity and dysfunctional mechanosensing based on inputs of dilatation only or dilatation and distensibility, each provided as either grayscale or heat maps. Most networks were able to predict the initiating insults within 10% relative $[eqn]$ error for both elastic fiber integrity (Fig 5a) and mechanosensing (Fig 5b). In particular, CNN-DeepONet and UNet architectures consistently predicted both insult profiles with approximately 5% error, even when trained on dilatation only. However, the relative $[eqn]$ error for the LNO in dilatation only cases exceeded 15%, and the FNN-DeepONet in the dilatation only heat map case rose above 10% for both insult contributors. Table 2 shows the corresponding prediction errors.

Performance of four different neural networks in predicting insult contributors to aneurysmal dilatation.Relative ℒ2 errors are reported separately for compromised (a) elastic (eln) fiber integrity and (b) mechanosensing over all testing cases considered: dilatation (d) only in grayscale and heat map formats, and dilatation and distensibility (d & 𝒟) in grayscale and heat map formats.

Table 2: Relative ℒ2 errors for all network-input data combinations predicting combined insult contributors shown in Fig 5, evaluated over all testing cases.The best results (lowest ℒ2 error) are highlighted by boldface in both cases.

Figs 6 and 7 show representative examples of insult profile predictions for each network. Between the DeepONet designs, the FNN-DeepONet exhibited worse overall performance than the CNN-DeepONet, with the greatest difference in the dilatation-only heat map case. The CNN-DeepONet was able to achieve a similar prediction accuracy as the UNet when both were trained with dilatation only, whereas its performance appeared similarly competitive with the LNO when trained with both dilatation and distensibility. LNO, while better than both DeepONets at predicting the insult profiles when trained on both dilatation and distensibility, showed the worst prediction error in dilatation-only cases. Overall, UNet achieved the greatest prediction accuracy, particularly with dilatation and distensibility input data (Figs C and D in S1 Supporting information show additional comparisons).

Predictions from all four architectures for an elastic fiber integrity-dominated combined insult.(a) The ground truth combined insult field consisted of contributions of both compromised elastic fiber integrity (ϑce) and dysfunctional mechanosensing (ϑδ) superimposed in the FE simulation to generate the TAA with dilatation and distensibility profiles shown in (b–c). Predictions (ϑ^i) and absolute errors (ϑ^i−ϑi (i=ce,δ)) are shown for CNN-DeepONet, FNN-DeepONet, UNet, and LNO trained on (b) dilatation grayscale maps only and (c) dilatation and distensibility grayscale maps.

Predictions from all four architectures for a mechanosensing-dominated combined insult.Similar to Fig 6 except for heat map inputs. (a) The ground truth combined insult field consisted of contributions of both compromised elastic fiber integrity (ϑce) and dysfunctional mechanosensing (ϑδ) superimposed in the FE simulation to generate the TAA with dilatation and distensibility profiles shown in (b–c). Predictions (ϑ^i) and absolute errors (ϑ^i−ϑi (i=ce,δ)) are shown for CNN-DeepONet, FNN-DeepONet, UNet, and LNO trained on (b) dilatation heat maps only and (c) dilatation and distensibility heat maps.

The greatest prediction error concentrated in the insult region with dilatation-only training; however, both DeepONet frameworks further yielded noticeable prediction errors away from the insult region, while UNet and LNO frameworks represented the non-insult region with high accuracy. Testing samples in which contributions of elastic fiber integrity and mechanosensing insults were more evenly balanced tended to be predicted with greater accuracy, while combined insults dominated by one factor were predicted less well.

Effects of distensibility maps

The inclusion of distensibility data in training reduced the prediction errors for all four networks by roughly a factor of 2 (Fig 5). As can be seen in Fig 6 and Fig 7, the prediction errors for both insult contributors effectively vanished upon the use of distensibility maps, especially for the high prediction errors of the FNN-DeepONet and LNO, and the network performances became comparable to one another. Nevertheless, there remained persistent small prediction errors with similar patterning in the dilatation-only cases, with DeepONet errors throughout the model domain and UNet and LNO errors clustered around the insult region.

Effects of input data format

Interestingly, use of grayscale versus heat maps for training did not significantly impact the predictions of the networks based on CNNs (Fig 8a–8b) or the LNO; for most networks, we observed only modest gains in prediction accuracy when using heat maps, and UNet performance was virtually the same regardless of data format (Fig 8e–8f). Yet, heat maps slightly improved the performance of the FNN-DeepONet in dilatation-distensibility cases (Fig 5), but their use in dilatation-only cases resulted in worse prediction errors (Fig 8c–8d).

Effects of grayscale versus heat map data inputs.All networks were trained on dilatation (d) maps only. (a) Ground truths for elastic fiber integrity and mechanosensing contributions. Predictions (ϑ^i) and computation of absolute errors (ϑ^i−ϑi (i=ce,δ)) are shown for (b) CNN-DeepONet, (c) FNN-DeepONet, (d) UNet, and (e) LNO.

Discussion

In previous work, we established the use of DeepONets comprised of CNN-based branch networks to estimate insults to the aortic wall resulting from either compromised elastic fiber integrity or dysfunctional mechanosensing based on dilatation and distensibility information. We found that, for single-contributor insults with arbitrary spatial distributions, full-field measurements of dilatation and distensibility in grayscale map format led to more robust predictions compared to sparse sensor point grids, demonstrating the gain in prediction accuracy with two-dimensional knowledge of the geometry and mechanical properties of the aneurysmal aorta rather than measuring only at sparse locations around the point of maximum dilatation [17]. Herein, we extended this prior approach by generating new synthetic data that superimposes multiple insults to both mechanical properties and mechanobiological processes, while contrasting multiple neural network architectures and formats for data input. Combined with parameterization of the initial model to a diseased murine ascending aorta, this allowed the synthetic data to further close the gap between hypothetical in silico and in vivo cases. Moreover, to bracket the potential loss in predictive accuracy when the gold standard of input data cannot be reached, we investigated an intermediate case in which only dilatation measurements were available to train the model. Finally, with recent advancements in neural network applications to biomedical image analysis, we assessed the performance of UNets and LNOs as alternative approaches in performing the same tasks.

UNet as the preferred architecture for estimating combined insults

Reasonable predictions of combined insults were achieved with all networks considered in this investigation. Our previous DeepONet-based architecture continued to perform well for multi-contributor TAAs, including an additional framework comprised of FNNs, confirming our initial choice of network. Nevertheless, these architectures were still prone to modest prediction errors throughout the model domain, even far from the actual insult region (this can be seen in Fig E in S1 Supporting information, where despite the good predictions achieved by all networks, spatial distributions of error varied widely by network type). Alternatively, both UNet and LNO predictions consistently captured the boundaries of the insult region with much greater accuracy than the DeepONets, owing to their ability to capture low-order and high-order features in the input data (in contrast, fully connected CNNs are considered better with texture mapping than with feature detection in encoding images). Even significant UNet and LNO errors remained concentrated at the insult region. This difference is especially prominent in dilatation-only testing samples in which all networks yielded poor predictions (Fig 6). LNO performed the worst in these scenarios, though this is due to incorrectly estimating the magnitude of insult contributors and not because of failing to correctly identify the spatial distribution of the insult. We emphasize that, for assessing the functional state of the aorta, correct prediction of the insult magnitudes is paramount, especially if the clinical focus is the region of greatest dilatation.

Because predictions away from the aneurysm apex may be unrealistic to validate in clinical cases, we also reevaluated the error estimates on only the regions where the normalized insult was 50% or above ( $[eqn]$ ), concentrating on the most affected regions in each testing sample. Even after this filtering, the overall performances of each network exhibited the same trends as when evaluated on the whole domain, with one exception that UNets trained on heat maps rather than grayscale maps became the best performing architecture (Table B in S1 Supporting information). Overall, these results suggest for dilatation-only scenarios, both CNN-DeepONet and UNet could be appropriate choices, with the caveats of small, diffuse prediction errors arising from the DeepONet and the requirement that input data be interpolated onto square domains. In dilatation-distensibility cases, UNet and LNO outperformed the DeepONet-based frameworks in predicting the combined insult profiles, with UNet exhibiting the best prediction accuracy overall regardless of the input data format. Therefore, for predictions based on two-dimensional full-field maps, UNet is the preferred choice due to its accuracy and multi-scale processing capabilities, even if distensibility is not available.

Importance of distensibility in mixed insult data

We observed correlations between maximum dilatation and minimum distensibility depending on the relative contributions of elastic fiber integrity and mechanosensing insults. All combined insults decreased distensibility, though arising from different causes. In the case of compromised elastic fiber integrity, the prescribed reduction in mechanical properties of the elastin-dominated extracellular matrix resulted in both a transferal of load from the more compliant elastin to the stiffer collagen as well as further stimulation of production of collagen fibers; this double-hit contributed to the decreased distensibility. On the other hand, dysfunctional mechanosensing left the properties of the elastic fibers intact while governing the turnover of smooth muscle cells and especially collagen fibers to stiffen the wall. Being that elastic fibers play a dominant role in the ability of the aorta to distend with blood pressure changes, it is not surprising that the most dramatic effects are observed in elastic fiber integrity-dominated insults. This can be seen in Fig F in S1 Supporting information, in which synthetic TAAs that exhibit mostly the same average dilatation distributions fall into distinct distensibility patterns as a function of elastic fiber loss. We hypothesize that this differentiation in distensibility profile is what allows the networks to achieve accurate predictions when provided distensibility maps in either grayscale or heat map formats, as well as why dilatation-only cases proved more difficult to predict well.

Finally, examining the computed stress distributions within the synthetic data reveals dramatically different profiles depending on the dominating contributor (Fig G in S1 Supporting information). In particular, mechanosensing-dominated combined insults associated with significantly higher circumferential ( $[eqn]$ ) and axial ( $[eqn]$ ) stresses within the aneurysmal region. Estimating the intramural shear stress as $[eqn]$ , which could serve as an analog for understanding vulnerability to dissection, is shown to be sensitive to mechanosensing dysfunction. This underscores that the mechanisms driving aortic dilatation may have a profound impact on the mechanical stability of the aneurysm while presenting nearly identically based on geometry alone. We thus reiterate that the ability to distinguish underlying contributors to TAA may provide critical insight into the potential mechanical strength of the aortic wall within the aneurysmal region.

Limitations

This study focused on combined insults of compromised elastic fiber integrity and dysfunctional mechanosensing in a model of a single initial geometry and material parameter set. Other studies suggest that additional effects could be considered, including altered mechanoregulation of matrix and compromised collagen cross-linking, in the synthetic data. We submit nonetheless that this work constitutes a viable potential pipeline demonstrating that dilatation and distensibility maps are essential to accurately estimate superimposed insults. Additionally, future work will need to incorporate increased variability in vessel dimensions, material properties, and loading conditions from diverse experimental investigations.

Next, the FE simulation platform used a computationally efficient implementation of equilibrated aortic growth and remodeling, yielding TAA geometries considered to be mechanobiologically stable, meaning no further growth of the TAA would be expected. Although this may be appropriate for a portion of TAA patients, these synthetic datasets do not address a TAA that would undergo unstable growth or develop a dissection or rupture. Furthermore, uniform cylindrical initial geometries were used to generate dilatation and distensibility maps; while effort was made to simulate TAAs of appropriate size for the initial vessel, the curvature of the ascending aorta was not considered. This simplification in original geometry allowed us to assess effects of combined insults, the utility of possible down-sampling, and so on, while targeting a level of dilatation (1.5-fold increase) that is clinically important independent of effects of curvature of the ascending aorta. Therefore, future work will need to consider both stable and unstable growth of TAAs based on combined insults applied to image-based aortic geometries, in which the UNet and LNO will be expected to have particular relevance. Looking to the future, we note that, if provided synthetic data of a longitudinal, time-evolving nature, different performances for these networks may reveal an alternative preferred choice of network, given that LNO is better suited for temporal applications and DeepONet is not as well suited.

Finally, there remains a compelling need to adapt this synthetic data pipeline to incorporate experimental data from actual human TAAs. With recent advancements in rapid three-dimensional reconstruction of the aorta from medical images [44], obtaining biomechanical properties from human samples [45], and using machine learning surrogate models to augment or complement high-cost biomechanical simulations [46], the potential to generate robust training data from clinically relevant growth and remodeling models may soon be a reality, at which point these pre-trained neural operators can help improve clinical decision-making.

Conclusion

In summary, using machine learning approaches for accurate prediction of mechanobiological insults that give rise to similarly sized TAAs promises to better discriminate mechanical vulnerability, thereby helping to inform subject-specific clinical treatment planning. Toward this end, we performed a systematic comparison of multiple neural networks in predicting the spatial distribution of synthetic multi-contributor TAA initiators when trained on dilatation information alone versus both dilatation and distensibility. Our findings highlight that:

Full-field measurement of the ascending aortic domain facilitates localized assessment of the underlying aortic insult, advancing beyond the current clinical standard of evaluating a singular maximum aortic size.Localized dilatation fields have substantial predictive value in estimating mechanisms of TAA progression; however, corresponding knowledge of local distensibility is essential to accurately determine the relative contributions of multiple disease drivers.UNet-based neural networks are an efficient and promising tool for performing these predictions based on image-derived aortic quantities.

Ultimately, as progress continues in machine learning-assisted automatic segmentation of medical images and generation of biomechanical FE models, the prospect of obtaining these scalar-field measurements in a rapid, patient-specific manner could yield a minimally invasive, image-based approach for improved TAA risk assessment and management.

Supporting information

S1Supporting informationSupplemental Tables A & B and Figs A–G.(PDF)

Bibliography44

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Senser EM, Misra S, Henkin S. Thoracic aortic aneurysm: a clinical review. Cardiol Clin. 2021;39(4):505–15. doi: 10.1016/j.ccl.2021.06.003 34686263 · doi ↗ · pubmed ↗
2Kim JB, Spotnitz M, Lindsay ME, Mac Gillivray TE, Isselbacher EM, Sundt TM. Risk of aortic dissection in the moderately dilated ascending aorta. Journal of the American College of Cardiology. 2016;68(11):1209–19. doi: 10.1016/j.jacc.2016.06.02527609684 · doi ↗ · pubmed ↗
3Mansour AM, Peterss S, Zafar MA, Rizzo JA, Fang H, Charilaou P, et al. Prevention of aortic dissection suggests a diameter shift to a lower aortic size threshold for intervention. Cardiology. 2018;139(3):139–46. doi: 10.1159/000481930 29346780 · doi ↗ · pubmed ↗
4Humphrey JD, Milewicz DM, Tellides G, Schwartz MA. Cell biology. Dysfunctional mechanosensing in aneurysms. Science. 2014;344(6183):477–9. doi: 10.1126/science.1253026 24786066 PMC 4360903 · doi ↗ · pubmed ↗
5Humphrey JD, Schwartz MA, Tellides G, Milewicz DM. Role of mechanotransduction in vascular biology: focus on thoracic aortic aneurysms and dissections. Circ Res. 2015;116(8):1448–61. doi: 10.1161/CIRCRESAHA.114.304936 25858068 PMC 4420625 · doi ↗ · pubmed ↗
6Karimi A, Milewicz DM. Structure of the Elastin-contractile units in the thoracic aorta and how genes that cause thoracic aortic aneurysms and dissections disrupt this structure. Can J Cardiol. 2016;32(1):26–34. doi: 10.1016/j.cjca.2015.11.004 26724508 PMC 4839280 · doi ↗ · pubmed ↗
7Yamashiro Y, Yanagisawa H. The molecular mechanism of mechanotransduction in vascular homeostasis and disease. Clin Sci (Lond). 2020;134(17):2399–418. doi: 10.1042/CS 20190488 32936305 · doi ↗ · pubmed ↗
8Creamer TJ, Bramel EE, Mac Farlane EG. Insights on the pathogenesis of aneurysm through the study of hereditary aortopathies. Genes (Basel). 2021;12(2):183. doi: 10.3390/genes 12020183 33514025 PMC 7912671 · doi ↗ · pubmed ↗