Stability-preserving model order reduction for linear stochastic Galerkin systems
Roland Pulch

TL;DR
This paper investigates methods to preserve stability in model order reduction of linear stochastic Galerkin systems, ensuring reliable reduced models for high-dimensional uncertainty quantification in engineering applications.
Contribution
It compares two stability-preserving transformations for Galerkin systems, analyzing their properties and demonstrating their effectiveness through numerical examples.
Findings
Both transformation techniques effectively preserve stability.
Numerical results show accurate reduced models for mechanical and electrical systems.
Different properties of the two methods influence their numerical performance.
Abstract
Mathematical modeling often yields linear dynamical systems in science and engineering. We change physical parameters of the system into random variables to perform an uncertainty quantification. The stochastic Galerkin method yields a larger linear dynamical system, whose solution represents an approximation of random processes. A model order reduction (MOR) of the Galerkin system is advantageous due to the high dimensionality. However, asymptotic stability may be lost in some MOR techniques. In Galerkin-type MOR methods, the stability can be guaranteed by a transformation to a dissipative form. Either the original dynamical system or the stochastic Galerkin system can be transformed. We investigate the two variants of this stability-preserving approach. Both techniques are feasible, while featuring different properties in numerical methods. Results of numerical computations are…
| dimension | 11400 |
|---|---|
| number of outputs | 1140 |
| # non-zeros in | 13110 |
| # non-zeros in | 50958 |
| spectral abscissa of | |
| spectral abscissa of | 42.69 |
| # nodes in quadrature | 10 | 20 | 30 | 40 | untransf. | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| # stable ROMs | 79 | 91 | 96 | 100 | 38 | |||||
| computation time (s) | 19.0 | 38.0 | 56.8 | 75.6 | – |
| nnz in | nnz in | |||
|---|---|---|---|---|
| untransformed Galerkin system | (0.01%) | (0.04%) | ||
| Galerkin system in (ii) | (23.1%) | (23.1%) |
| computation time (s) | |
|---|---|
| Arnoldi method for original system | 2.4 |
| Arnoldi method for system in (ii) | 65.4 |
| technique (i) | 75.6 |
| technique (ii) | 2591.9 |
| technique (iii) | 0.1 |
| dimension | 6900 |
| number of outputs | 300 |
| # non-zeros in | 21912 |
| # non-zeros in | 21912 |
| spectral abscissa of | |
| spectral abscissa of |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
**Stability-preserving model order reduction
for linear stochastic Galerkin systems**
Roland Pulch
Institute of Mathematics and Computer Science, University of Greifswald,
Walther-Rathenau-Str. 47, D-17489 Greifswald, Germany.
Email: [email protected]
Abstract
[TABLE]
1 Introduction
Numerical simulation of mathematical models represents the main issue in scientific computing. We consider linear dynamical systems, which play an important role in mechanics and electrical engineering, for example. Furthermore, uncertainty quantification becomes more and more relevant in many fields of applications, see [17], for instance. A common approach is to replace uncertain parameters by random variables, see [37, 39]. Statistics of the stochastic model can be computed by sampling methods or quadrature rules. Alternatively, the stochastic Galerkin method changes the random-dependent linear dynamical system into a larger deterministic linear dynamical system.
The dimension of the stochastic Galerkin system becomes huge in the case of large numbers of random variables. Methods of model order reduction (MOR) are able to decrease the complexity. Transient solutions of a reduced system allow for an efficient numerical simulation. Several MOR methods are available for general linear dynamical systems, see [1, 2, 4, 33]. MOR of linear stochastic Galerkin systems was also examined in several previous works [9, 19, 32, 26, 27, 40]. MOR of nonlinear stochastic Galerkin systems was considered in [30].
However, even though the linear stochastic Galerkin system is asymptotically stable, the reduced Galerkin system often looses this stability in some MOR techniques. We investigate stability-preserving strategies in the case of Galerkin-type projection-based MOR like the Arnoldi method or proper orthogonal decomposition, for example. Galerkin-type MOR can be applied to any linear dynamical system (not only stochastic Galerkin systems). A dissipativity property guarantees the preservation of stability. If a general linear dynamical system does not satisfy the dissipativity property, then it can be transformed into a dissipative structure, see [7, 24, 29]. The crucial part to identify a transformation consists in the solution of a Lyapunov equation. Direct methods, see [11], or approximate methods, see [22, 23, 35, 38], yield the numerical solutions of Lyapunov equations.
We examine the stability-preserving approach in the case of linear stochastic Galerkin systems consisting of ordinary differential equations. Several variants are feasible. The high-dimensional Galerkin-projected system is transformed or, vice versa, the original systems are transformed followed by a Galerkin projection. We analyze the two strategies and another variant.
In addition, network approaches produce models consisting of differential-algebraic equations in industrial applications. Thus we extend the stability-preserving techniques to this class of problems. The Lyapunov equations have no solution now. Therefore, we use a regularization technique, which was also employed in [20].
We apply the analyzed techniques to mathematical models of two test examples: a mass-spring-damper system and an electric circuit of a band-pass filter.
2 Stability preservation in reduction
We review a concept for stability preservation in Galerkin-type projection-based MOR for general linear dynamical systems.
2.1 Linear dynamical systems
We consider linear dynamical systems in the form
[TABLE]
with constant matrices , , and . The state variables or inner variables are . Inputs are supplied to the system. The outputs are defined as quantities of interest (QoI). Initial value problems are given by .
If the mass matrix is non-singular, then the system (1) consists of ordinary differential equations (ODEs). If the mass matrix is singular, then the system (1) represents differential-algebraic equations (DAEs). Furthermore, we assume that the system satisfies the following stability condition.
We specify some common notions.
Definition 1**.**
Given a matrix pencil with , the set of eigenvalues is and the spectral abscissa reads as . The spectral abscissa of a single matrix is the spectral abscissa of the matrix pencil with the identity matrix .
Definition 2**.**
A matrix pencil with is called regular, if there is (at least one) such that .
Definition 3**.**
A linear dynamical system (1) is called asymptotically stable, if the involved matrix pencil has a spectral abscissa satisfying .
The asymptotic stability implies that the generalized eigenvalue problem has only a finite number of eigenvalues, which all exhibit a negative real part. We assume that the system (1) satisfies this stability condition. In the case of ODEs, the matrix pencil is always regular, even if the system is unstable. In the case of DAEs, the asymptotic stability implies that the matrix pencil is regular.
2.2 Transfer functions and Hardy norms
The input-output behavior of a linear dynamical system (1) can be described in the frequency domain, see [1] for ODEs and [6] for DAEs.
Definition 4**.**
The transfer function of a linear dynamical system (1) is the mapping with
[TABLE]
where represents the set of eigenvalues from Definition 1.
The transfer function is always a rational function, whose poles are the eigenvalues. The regularity of the matrix pencil guarantees a finite set of poles. The magnitude of a transfer function can be measured by Hardy norms, see [1].
Definition 5**.**
The -norm of a transfer function reads as
[TABLE]
including the Frobenius matrix norm , the angular frequency , and .
Since we assume asymptotically stable linear dynamical systems (1), the transfer function is defined on the complete imaginary axis. In the case of ODEs, the -norm is always finite. In the case of DAEs, the existence of the -norm is not guaranteed. Nevertheless, the -norm is quite often finite for DAEs of index 1 or 2. The norm (3) may also exist for unstable systems, if there is no pole on the imaginary axis.
2.3 Projection-based model order reduction
Projection matrices of full rank are specified with . Concerning the full-order model (FOM) in (1), the reduced-order model (ROM) reads as
[TABLE]
with state variables or inner variables . Initial values are supposed. We obtain the matrices via
[TABLE]
which is also called a Petrov-Galerkin-type MOR. A Galerkin-type projection-based MOR is characterized by , where just one projection matrix has to be determined. Important examples are the one-sided Arnoldi method and the proper orthogonal decomposition (POD), see [1].
Each linear dynamical system is described by a transfer function (2) in the frequency domain. The difference between the transfer functions of FOM and of ROM quantifies the error of the MOR. Hardy norms like the -norm from Definition 5, for example, can be applied to their transfer functions. However, a small error in the frequency domain implies a small error in the time domain only if both systems are asymptotically stable. It holds that
[TABLE]
with the maximum vector norm and the -norm in time provided that all initial values are zero, see [3]. The -norm of Definition 5 is a strong measure. Hence there are neither a priori error bounds nor cheap a posteriori error bounds available in MOR techniques. Just an approximation of the -norm in (6) can be computed posterior, where the computational effort is dominated by evaluations of the transfer function in the FOM. The balanced truncation method yields an a priori error bound in the -norm, see [1, p. 212].
2.4 Dissipative systems
In balanced truncation, see [1], the ROM (4) is always asymptotically stable provided that the system (1) is asymptotically stable. Yet the asymptotic stability may be lost in the ROM (4) within other MOR methods like Krylov subspace techniques, see [10], and POD, for example.
Using Galerkin-type MOR, the stability is guaranteed for some classes of linear dynamical systems. In the case of ODEs, we define the following type of system.
Definition 6**.**
A linear dynamical system (1) is called dissipative, if
* is symmetric as well as positive definite, and* 2. 2.
* is negative definite.*
The above condition represents a dissipativity of the matrix as shown in [21]. Other definitions of dissipative systems are used in the literature. We prove a property, which was shown for an explicit system of ODEs in [24].
Theorem 1**.**
If the linear dynamical system (1) is dissipative with respect to Definition 6, then it is also asymptotically stable as in Definition 3.
Proof:
Let be the Cholesky decomposition of the mass matrix. The system (1) is equivalent to the explicit system of ODEs
[TABLE]
with and . The symmetric part of the involved system matrix reads as
[TABLE]
It holds that for any . Thus is negative definite. Let be an eigenvalue of and be an associated eigenvector satisfying . It follows that
[TABLE]
Thus the systems (7) and (1) are asymptotically stable.
In contrast, the asymptotic stability does not imply the dissipativity of Definition 6, even if the mass matrix is symmetric and positive definite. An example will be presented in Section 6.1. Now we consider Galerkin-type MOR.
Theorem 2**.**
If the linear dynamical system (1) is dissipative, then a Galerkin-type MOR yields a dissipative reduced system (4). Hence the reduced system is asymptotically stable.
The proof can be found in [29], for example.
2.5 Transformations
The asymptotic stability of a linear dynamical system is invariant with respect to basis transformations. In contrast, if a system of ODEs is not dissipative, then it can be converted to an equivalent dissipative form by a basis transformation in the state space, see [24]. Alternatively, a basis transformation is feasible in the image space only, see [7]. We require a symmetric positive definite solution of the Lyapunov inequality
[TABLE]
which means that the matrix on the left-hand side of (8) is negative definite. If the mass matrix is non-singular and the matrix pencil satisfies the Definition 3 of asymptotic stability, then an infinite set of solutions exists.
We change the system of ODEs (1) into the equivalent system
[TABLE]
The transformed system exhibits the desired dissipativity property.
Theorem 3**.**
If the asymptotically stable linear dynamical system (1) has a non-singular mass matrix , then the transformed system (9) with satisfying (8) is dissipative in view of Definition 6.
We use a Galerkin-type MOR with a projection matrix to the transformed system (9). This approach can be written as a Petrov-Galerkin-type MOR applied to the original system (1) with matrices (5) and the projection matrix
[TABLE]
Thus we do not require to calculate the transformed system (9) explicitly. Alternatively, we compute the projection matrix (10). However, the original Galerkin-type MOR of the system (1) is not equivalent to the MOR of the system (9).
2.6 Numerical solution of Lyapunov inequality
We solve the Lyapunov inequality (8) using a Lyapunov equation
[TABLE]
including a predetermined symmetric positive definite matrix . This matrix represents a degree of freedom, because any choice yields a symmetric positive definite solution of the Lyapunov inequality. A simple admissible choice is the identity matrix . Moreover, we do not need to solve the Lyapunov equation (11) with a high accuracy, because a rough approximation often still satisfies the Lyapunov inequality (8). In [28], it was shown that any approximation of the exact solution of the Lyapunov equation (11) for with the property
[TABLE]
in some subordinate matrix norm also solves the Lyapunov inequality (8). The condition (12) is just sufficient and not necessary. However, approximations satisfying (12) may have a relative error of up to 100%, see [28], which motivates that rough estimates can solve the problem.
There are direct methods to compute a solution of (11) or a symmetric decomposition , see [11, 22]. Their computational effort is typically . In the high-dimensional case, we have to use approximate methods to decrease the computation work. The following techniques are available:
- i)
projection methods (Krylov subspace techniques, POD, etc.), see [13, 38],
- ii)
alternating direction implicit (ADI) iteration, see [15, 23],
- iii)
frequency domain integrals, see [5, 28],
and others. In the cases (i) and (ii), the methods yield an approximation with a low-rank factor (). Thus the transformation is given by a singular matrix . It follows that the mass matrix of the reduced system (4) may become singular or ill-conditioned, as shown in [29]. In contrast, the method (iii) from [28] computes the projection matrix (10), where the underlying approximation is always non-singular. However, the matrix is never computed but a matrix-matrix product with this approximation. In the frequency domain integral approach, the projection matrix has to be determined by the original linear dynamical system (1).
3 Stochastic Galerkin systems
We illustrate the concept of the stochastic Galerkin method and define our problem under investigation.
3.1 Random linear dynamical systems
We include parameters in the linear dynamical systems and obtain
[TABLE]
The matrices , , and depend on parameters . We assume that the dimension is independent of the number of parameters . The values may represent physical parameters or artificial parameters.
Thus the state variables or inner variables depend on time as well as the parameters. The inputs are independent of the parameters, whereas the outputs vary with respect to the parameters. We consider a single output () without loss of generality.
We assume that the system (13) is either an ODE for all or a DAE for all . Let the system be asymptotically stable for each parameter with respect to Definition 3. Furthermore, initial value problems
[TABLE]
are considered including a function . The initial values may be independent of the parameters. The initial values have to be consistent in the case of DAEs.
We suppose that the parameters in the linear dynamical system (13) are affected by uncertainties. A common approach is to substitute the parameters by independent random variables on a probability space with event space , sigma-algebra and probability measure , see [37, 39]. We apply traditional probability distributions like uniform, Gaussian, beta, etc. Hence a joint probability density function is available. This approach yields a stochastic model.
A measurable function depending on the random variables exhibits the expected value
[TABLE]
provided that the integral is finite. The expected value (15) implies the inner product
[TABLE]
for two functions in the Hilbert space
[TABLE]
The associated norm is as usual.
3.2 Polynomial chaos expansions
In most cases, a complete orthogonal basis of polynomials exists. These multivariate polynomials are the products
[TABLE]
where is the family of univariate orthogonal polynomials with respect to the th random variable. The degree of is exactly . Let this basis also be normalized. Each traditional probability distribution implies its own family of orthogonal basis polynomials, see [39]. The basis is complete in the case of uniform, beta, and Gaussian distribution, for example. Yet the polynomials do not span the complete Hilbert space (17) in the case of a log-normal distribution, see [8].
We assume that the QoI of the system (13) is in the space (17) for each time point. If the parameter domain is compact, then the continuity of the QoI is sufficient for belonging to (17) pointwise in time. If the parameter domain is unbounded, then integrability conditions have to be satisfied with respect to the probability distribution. Consequently, the QoI can be expanded into the series
[TABLE]
with coefficient functions , which is called a (generalized) polynomial chaos expansion (PCE). The series (19) converges in the norm of (17) pointwise in time.
Likewise, the state variables exhibit the PCE
[TABLE]
with coefficient functions , provided that each state variable is in the Hilbert space (17).
We assume that the first basis polynomial is the unique constant polynomial . The orthonormality of the basis functions imply the formulas
[TABLE]
for the expected value and the variance of the QoI in each time point.
3.3 Stochastic Galerkin method
We truncate the series (19),(20) to a finite sum. Typically, all basis polynomials up to some total degree are included. There is a bijective mapping between the integers and the multi-indices with the number of random parameters. In view of (18), we obtain the finite index set . The cardinality of this index set is
[TABLE]
If the number of random variables is large, then the number of basis polynomials becomes huge even for moderate degrees, say . Figure 1 illustrates the growth of the number of basis polynomials.
Without loss of generality, the truncated series read as
[TABLE]
Inserting the approximations (21) into the linear dynamical system (13) yields a residual. The Galerkin approach requires that the residual is orthogonal to the space spanned by the basis polynomials with respect to the inner product (16). Basic calculations produce a larger coupled linear dynamical system
[TABLE]
for and . Hence we obtain a linear dynamical system of dimension with outputs. The matrices exhibit the sizes , , and . To define the matrices, we introduce the auxiliary arrays
[TABLE]
It follows that
[TABLE]
using Kronecker products. Therein, the probabilistic integration (15) operates separately in each component of the matrices. Often it holds that . More details on the stochastic Galerkin method for linear dynamical systems are given in [25, 32].
The mass matrix may be singular even though all matrices are non-singular due to properties of the spectrum, see [36]. However, this loss of invertibility hardly occurs. Yet the mass matrix may become ill-conditioned. We assume that is always non-singular in the case of ODEs (13). Likewise, the stochastic Galerkin system (22) may be unstable even though the systems (13) are asymptotically stable for all parameters. Academic examples are given in [31]. Again this loss of stability is hardly observed in practice. Furthermore, the passivity of stochastic Galerkin systems was investigated for models of electric circuits in [16].
Initial conditions are derived from the initial values (14) of the original dynamical system (13) by an own truncated PCE. If the initial values (14) are identical to zero, then the choice is obvious. The approximation of the QoI reads as
[TABLE]
where the outputs of the stochastic Galerkin system (22) yield the coefficients.
If the entries of the matrices are polynomials depending on in the system (13), then the matrices (24) of the stochastic Galerkin system can be calculated analytically. Consequently, we obtain the matrices exactly (except for round-off errors) independent of the number of random variables. In contrast, stochastic collocation techniques, which are non-intrusive methods, cf. [37, 39], induce a quadrature error or sampling error. This error typically grows for fixed numbers of collocation points and increasing dimensions . Although the stochastic Galerkin approach represents an intrusive method, the effort of coding the algorithms is not extensive, because just constant matrices have to be specified for linear time-invariant systems.
We prove a property, which will be used later.
Lemma 1**.**
If the matrices are symmetric and positive definite for almost all , then the stochastic Galerkin projection is also symmetric and positive definite.
Proof:
The Galerkin-projected matrix consists of the blocks for , see (24). Hence the symmetry is obvious. Let . We obtain
[TABLE]
because the integrand is almost everywhere non-negative in the probabilistic integration (15). The basis functions are linearly independent. Thus implies that the above finite sum is non-zero on a subset satisfying for the probability measure . It follows that for .
Likewise, this relation applies to the case of negative definite matrices.
4 Stability preservation
We investigate three strategies to preserve the asymptotic stability in an MOR of a stochastic Galerkin system. Figure 2 illustrates different possibilities.
4.1 Transformation of stochastic Galerkin system
This approach follows the steps (c),(b),(f) in Figure 2. If the linear dynamical systems (13) are asymptotically stable for all parameters, then the stochastic Galerkin system (22) is usually asymptotically stable. However, if the system (22) is not dissipative, then stability may be lost in some MOR methods by step (e). In this case, we can transform the system to a dissipative form as demonstrated in Section 2.5 and Section 2.6. Consequently, the reduced system is stable due to Theorem 2. Remark that we do not have to calculate the transformed matrices of the high-dimensional system explicitly. Just an appropriate projection matrix has to be determined via (10).
In this approach, the critical part is the solution of the high-dimensional Lyapunov equation (11). A direct method of linear algebra would require operations. Thus we are restricted to approximate methods or iteration schemes. The possibilities are listed in Section 2.6.
4.2 Transformation of parameter-dependent system
Now the succession (a),(d),(f) is performed with respect to Figure 2. Consequently, we transform the linear dynamical systems (13) first.
4.2.1 Transformation
The stochastic Galerkin system is not always asymptotically stable, even if all parameter-dependent systems (13) are asymptotically stable. However, we obtain a positive result in the case of dissipativity.
Theorem 4**.**
If the linear dynamical systems (13) are dissipative for almost all , then the stochastic Galerkin system (22) is also dissipative. Consequently, a Galerkin-type reduction of (22) yields an asymptotically stable system.
Proof:
Lemma 1 implies that the mass matrix is symmetric and positive definite. The matrix is the Galerkin projection of due to
[TABLE]
see (24). Since is negative definite for almost all , Lemma 1 shows that is negative definite. Hence both conditions of Definition 6 are satisfied. Theorem 2 guarantees the stability of a reduced system.
If the original systems (13) are not dissipative for almost all , then we transform the systems appropriately. The following transformation has to be applied for almost all simultaneously (even if some system is already dissipative) to preserve continuity and smoothness of the functions.
We obtain the parameter-dependent Lyapunov equation
[TABLE]
for . Although the matrix may depend on the parameters, we choose a constant matrix, because no parameter-aware strategy with benefits is known yet. The Lyapunov equations (26) yield a unique family of symmetric positive definite matrices.
Furthermore, the stochastic Galerkin system (22) obtained by step (c) is not equivalent to a stochastic Galerkin system obtained by steps (a),(d). On the one hand, the stochastic Galerkin method is invariant with respect to transformations by constant non-singular matrices. On the other hand, we use the parameter-dependent matrix in the transformation (a) to the dissipative form (9).
4.2.2 Polynomial system matrices
Now we assume that the matrices of the system (13) involve only polynomials in the variable , which is often given in practice. Thus the matrices of the stochastic Galerkin system (22) can be calculated analytically in the case of traditional probability distributions. We do not consider the output matrix , because it is not transformed. However, the matrix-valued function satisfying (26) consists of rational functions in the variable . In the case of low dimensions , the function can be calculated explicitly by a computer algebra software. Alternatively, the solution can be evaluated for a finite set of parameters .
We require the Galerkin projection of the transformed matrices
[TABLE]
in the dissipative system (9). The entries of these transformed matrices are rational functions of . A quadrature rule yields numerical approximations of the matrices . Let be the nodes and be the weights. The approximation of the blocks in reads as
[TABLE]
for . Using the auxiliary array (23), the approximation becomes
[TABLE]
Likewise, the quadrature yields and . The computational effort is dominated by determining numerical solutions of the Lyapunov equations (26).
Lemma 2**.**
If all weights are positive, then a quadrature of kind (27) yields approximations , where is symmetric positive semi-definite and is negative semi-definite.
Proof:
The symmetry of is obvious. Let with and
[TABLE]
We obtain using the notation (28)
[TABLE]
Since is positive definite and the weights are positive for each , all terms of the sum are non-negative. Hence is positive semi-definite. The negative semi-definiteness of is concluded by the same treatment.
In addition, it is very likely that one or more terms are positive in the above sum. Thus we assume that these matrices are definite. It follows that the approximate Galerkin system has the dissipativity condition of Definition 6.
Concerning the positivity of the weights, we address three classes of multivariate quadrature schemes or sampling methods, which can all be found in [37]:
- i)
tensor-product formulas: Univariate quadrature rules often exhibit exclusively positive weights like Gaussian quadrature, for example. The tensor-product formulas inherit the positivity, because their weights are the products of the weights in the univariate case.
- ii)
sparse grid quadrature: Often both positive and negative weights occur. Sparse grids with purely positive weights typically require more nodes for the same accuracy.
- iii)
(quasi) Monte-Carlo methods: In these sampling techniques, the weights are for all and thus positive.
4.2.3 Properties
The strategy of this section looks appealing in the case of low dimensions , because the Lyapunov equations can be solved cheap. However, the number of random parameters is typically large in this case, because otherwise the stochastic Galerkin system is not high-dimensional and an MOR is obsolete. A large number of random variables implies a quadrature in a high-dimensional space. Using a computationally feasible number of nodes, the quadrature error may still be too large such that the exact solution of the approximate Galerkin system yields poor approximations (25) of the random-dependent QoI. Therefore, the above approach is critical.
Furthermore, there is a major loss of sparsity in this technique. Let the entries in the matrices be polynomials of low degrees depending on in the system (13). Even if the matrices are dense, then the stochastic Galerkin system (22) exhibits sparse matrices due to the orthogonality of the basis polynomials. However, the dense matrices of the transformed systems (9) are rational functions depending on . Inner products (16) of their entries and orthogonal polynomials are non-zero and thus the matrices of (22) become dense in the Galerkin projection. In contrast to the approach of Section 4.1, we have to calculate the matrices of the alternative Galerkin system explicitly to determine a projection matrix of the MOR.
4.3 Transformation using reference parameter
Again the steps (c),(b),(f) are considered in Figure 2, where the transformation is done different from Section 4.1. We derive an additional technique to decrease the computational effort and to omit quadrature errors. A single reference parameter is selected like the mean value , for example. We directly solve the Lyapunov equation (26) only for using some matrix . The solution is symmetric and positive definite. We define the larger transformation matrix
[TABLE]
using the identity matrix and the Kronecker product. Obviously, the matrix (29) is symmetric and positive definite again. We employ this matrix to transform the stochastic Galerkin system (22) into the form (9). Since the matrix (29) is block-diagonal, matrix-matrix multiplications with are cheap. We do not need to compute the transformed system matrices. Alternatively, we directly compute the projection matrix (10), where a projection matrix is determined by the original Galerkin system.
Any random variables can be decomposed into the form . We consider the random variables
[TABLE]
with the real parameter .
Theorem 5**.**
Let the random variables in a linear system of ODEs (13) be of the form (30). There is a positive constant such that the transformation of a stochastic Galerkin system (22) using the matrix (29) yields a dissipative system of type (9) for all .
Proof:
The transformed mass matrix is symmetric and positive definite for all , since just the non-singularity of the original mass matrix is required. In the limit, we obtain
[TABLE]
due to the orthogonality of the basis polynomials. Hence the matrix becomes block-diagonal with identical blocks. The factor satisfies the Lyapunov equation (26) for the chosen positive definite matrix . The symmetric part of the matrix (31) is negative definite, because it holds that
[TABLE]
The two conditions of Definition 6 are satisfied and thus the system is dissipative in the limit. Since the limit (31) is reached continuously with respect to the parameter , it follows that a sufficiently small perturbation of still yields matrices with the required properties.
Theorem 5 shows that the stochastic Galerkin system becomes dissipative for all sufficiently small . However, this implication does not guarantee that the system is dissipative for , which reproduces our desired choice of random variables in (30). Nevertheless, the computational effort is low such that it is worth to try this approach. Even if this transformed stochastic Galerkin system is not dissipative, a loss of stability may happen less often in an MOR.
4.4 Nonlinear dynamical systems
We outline a stabilization concept for nonlinear dynamical systems. Let an autonomous system of ODEs be given in the form
[TABLE]
with a parameter-dependent mass matrix and a nonlinear smooth right-hand side . The QoI is defined by the linear or nonlinear function . Let . We assume that a family of asymptotically stable stationary solutions exists, i.e., for all . The definition of stable stationary solutions can be found in [34, p. 22]. As in [31], the system (32) is replaced by the equivalent system
[TABLE]
which exhibits the constant asymptotically stable stationary solution . Let be the Jacobian matrix of evaluated at and . The stochastic Galerkin method yields a larger nonlinear system of ODEs
[TABLE]
including the mass matrix from (24). The definition of the nonlinear function is given in [30, 31]. This function inherits the smoothness of . Now is a stationary solution of (34). Although this stationary solution may be unstable, this loss of stability hardly occurs in practice. Thus we assume that the stationary solution is asymptotically stable again. Let be the Jacobian matrix of evaluated at . It follows that the spectral abscissa of the matrix pencil is negative, see Definition 1.
A Galekin-type projection-based MOR yields a small dynamical system
[TABLE]
with a projection matrix and the mass matrix . The reduced system owns the stationary solution , which may be unstable.
Stability-preserving techniques can be derived. We consider the method of Section 4.1, where the high-dimensional Lyapunov equation (11) is solved including the mass matrix from (34) and . The Galerkin system (34) is transformed and reduced to (35), while the asymptotic stability of the stationary solution is guaranteed as shown in [29, p. 35] for general nonlinear implicit systems of ODEs. Concerning the stability-preserving approach of Section 4.2, parameter-dependent Lyapunov equations (26) are solved including from (32) and with the Jacobian matrix associated to the stationary solution of (33). This technique was used for explicit ODEs in [31]. The application of the stability-preserving method from Section 4.3 is straightforward now.
5 Differential-algebraic equations
If the linear dynamical system (1) features a singular mass matrix , then a system of DAEs is given. MOR methods are also available for DAEs, see [6]. Yet the Lyapunov equations (11) do not have a solution, which is crucial in our stability-preserving technique. Let . It follows that
[TABLE]
Choosing an in the kernel of the matrix implies and thus a contradiction to the definiteness of the matrix appears. We require an alternative strategy now.
5.1 Regularization
We apply a regularization of an asymptotically stable DAE system (1), which was also used in [20]. The asymptotic stability guarantees a regular matrix pencil as specified by Definition 2. The regularized system matrices read as
[TABLE]
introducing parameters . The matrix is non-singular for all . Choosing , it follows that the linear dynamical system (1) with is asymptotically stable for all sufficiently small parameters , see [20]. An advantage of this regularization technique is that the sparsity pattern of coincides with the sparsity pattern of for each . Hence no loss of sparsity happens in the relevant operations.
We also choose a small parameter (and ) to ensure that the difference between the original DAE system and the regularized ODE system is small. Assuming that the DAE system together with the defined outputs has a transfer function with finite -norm (3), error bounds were derived with respect to this norm in [28].
5.2 Stochastic Galerkin projection
The steps (a)-(d) refer to the flowchart in Figure 2. The following two approaches are equivalent provided that the same parameters are chosen in a regularization (36):
- i)
Regularize the parameter-dependent system (13) and then project the systems of ODEs in the stochastic Galerkin method.
- ii)
Project the parameter-dependent DAE system (13) in the stochastic Galerkin method and then regularize the Galerkin system (22).
In both cases, the stochastic Galerkin projection (c) yields the same matrices and for the system (22). The mass matrix is non-singular due to the regularization.
The transformations (a) and (b) to a dissipative representation are done only for a regularized system, since a system of ODEs is required for each Lyapunov equation. The transformation (b) involves the matrices from (i) or (ii). However, in the transformation (a), we have to regularize the systems of DAEs (13) immediately. Furthermore, the drawbacks of the succession (a),(d), see Section 4.2.3, also apply in this case.
In the transformation (b), the approximate solution of the high-dimensional Lyapunov equations (11) may become critical. Small regularization parameters nearly coincide with the singular case. This problem is less pronounced in direct methods to solve the Lyapunov equations.
We can also employ the technique from Section 4.3 in this context. The linear dynamical system (13) is regularized for a reference parameter and some . Solving the Lyapunov equation (26) including yields the high-dimensional transformation matrix (29). Now the regularized Galerkin system from strategy (ii), where the same values are used, is transformed based on (29). Again the dissipativity of the transformed representation cannot be guaranteed a priori.
6 Illustrative examples
We investigate a system of ODEs as well as a system of DAEs in this section. We computed on a FUJITSU Esprimo P920 Intel(R) Core(TM) i5-4570 CPU with 3.20 GHz (4 cores) and operation system Microsoft Windows 7. The software package MATLAB [18] (version R2018a) was used for all computations.
6.1 Mass-spring-damper system
Figure 3 depicts a mechanical configuration, which consists of 5 masses, 7 springs and 5 dampers. The single input is the excitation at the bottom spring, whereas the single output is the position of the top mass. A mathematical modeling yields a linear dynamical system (13) of ODEs of first order including parameters. The system is asymptotically stable for all positive parameters. The system matrices are affine-linear functions (polynomials of degree one) of the parameters. The Bode plot of the system is shown for a specific choice of the parameters in Figure 4. This system represents an extension of a test example used in [14], where 4 masses were included.
In the stochastic modeling, we replace the parameters by independent uniformly distributed random variables, which vary 10% around their mean values. The mean values are the constant parameters from above. In the truncated orthogonal expansion (21), we include all multivariate Legendre polynomials up to degree three. We obtain basis functions. The stochastic Galerkin system (22) is calculated exactly except for round-off errors. Table 1 illustrates the properties of this example. The system is asymptotically stable. The mass matrix is symmetric and positive definite. Yet the system is not dissipative.
We employ the one-sided Arnoldi method, see [1], to perform an MOR of the stochastic Galerkin system (22). The single real expansion point is used in this Krylov subspace technique. The reduced systems are arranged for dimensions . It follows that just 38 ROMs are stable.
Now we investigate the stabilization techniques from Section 4. The following cases are discussed:
- i)
transformation of stochastic Galerkin system (Section 4.1),
- ii)
transformation of original systems (Section 4.2), and
- iii)
transformation based on reference parameter (Section 4.3).
In (i) and (iii), we reuse the projection matrix determined for the original stochastic Galerkin system, because the Arnoldi method is invariant with respect to a basis transformation in the image space. In (ii), we repeat the Arnoldi algorithm for the alternative stochastic Galerkin system, because a smaller reduction error is expected.
We choose the identity matrix () as degree of freedom in each Lyapunov equation. In all three approaches, the stability is achieved for each ROM. Figure 5 illustrates the relative errors with respect to the -norm (3), i.e.,
[TABLE]
with the transfer functions of FOM and ROM, respectively.
In the technique (i), we use the method from [28], where an integral is discretized by a quadrature rule in the frequency domain. Therein, the solution of the high-dimensional Lyapunov equation is not computed but the associated projection matrix (10). The computation work is characterized by an -decomposition of the system matrix in each node for the angular frequency . We employ the (univariate) Gauss-Legendre quadrature. Table 2 shows the number of stable ROMs for different numbers of nodes. We observe that 40 nodes are sufficient to stabilize all reduced systems. Figure 5 (i) depicts the relative -errors (37) in this case. The error of the MOR exhibits the same magnitude as in the original stochastic Galerkin system.
In the technique (ii), we use a sparse grid quadrature of Smolyak-type with level 3 based on the Clenshaw-Curtis rule. The scheme exhibits 7209 nodes in the 17-dimensional space and negative weights arise. Table 3 illustrates the loss of sparsity in this approach. Again all ROMs become asymptotically stable. The relative errors (37) of the MOR are demonstrated in Figure 5 (ii). The errors between the ROMs and the stochastic Galerkin projection of the transformed systems, which we call the inherent errors, decay for increasing dimensions. However, the errors between these ROMs and the stochastic Galerkin system of the untransformed systems (13), which we call the comparison, are large and stagnate. This property indicates that the quadrature error is too large, even though a high number of nodes is used, because the original Galerkin system can be considered sufficiently accurate.
In the technique (iii), we define the mean value of the random variables as reference parameter. We calculate the matrices of the transformed Galerkin system to analyze the definiteness. Figure 6 depicts the maximum eigenvalue of the symmetric part of with from (29) in dependence on the parameter from (30). We observe that the negative definiteness is lost for . Although our relevant case is not dissipative, the stability is preserved in all ROMs. Moreover, the error of the MOR is not compromised.
We also measured computation times illustrated by Table 4. The ROMs of dimension are determined using the Arnoldi method and the three stability-preserving techniques. The computation time of a stabilization technique consists of all steps to calculate the reduced matrices (5) except for the determination of the projection matrix , which is done by the Arnoldi method. In the technique (i), nodes are used in the Gaussian quadrature. On the one hand, the technique (ii) causes a relatively large computational effort due to the high number of nodes in the sparse grid quadrature. Also the Arnoldi algorithm is more expensive due to the loss of sparsity in the matrices. On the other hand, the technique (iii) requires a negligible computation work, which shows that it is worth trying this approach.
6.2 Band-pass filter
We examine the electric circuit of a band-pass filter shown in Figure 7. A single input voltage is supplied and a single output voltage drops at a load conductance. Modified nodal analysis [12] yields a linear system of DAEs of dimension . This DAE system exhibits the index one. The physical parameters are 7 capacitances, 7 inductances and 9 conductances (). Figure 8 depicts the Bode plot of the system for a constant selection of the parameters. Furthermore, the system matrices are affine-linear functions of the parameters.
In the stochastic modeling, we introduce independent uniformly distributed random variables varying 20% around their mean values. The truncated series (21) include all basis polynomials up to degree two, i.e., . The stochastic Galerkin system (22) consists of linear DAEs, which have a finite -norm (3) for the defined outputs.
In the regularization (36), we choose the parameters and . Table 5 shows the properties of the regularized system. The system is asymptotically stable. Both and are not symmetric. Thus the spectral abscissa, see Definition 1, of the symmetric part of is irrelevant. Multiplication by from the left yields a system with symmetric positive definite mass matrix, which is still not dissipative.
The one-sided Arnoldi method yields the projection matrices of the MOR for dimensions . Therein, we choose the real number as expansion point. A loss of stability occurs in the reduction of both the DAE system and the regularized system, as shown in Table 6. We apply the stabilization techniques of Section 4.1 and Section 4.3 to the regularized system. The Lyapunov equations are solved using the identity matrix as input matrix. We solve the Lyapunov equations by a direct method of linear algebra. Thus a critical behavior for small regularization parameters, as mentioned in Section 5.2, is avoided. The matrices of a transformed Galerkin system are never computed explicitly. All ROMs become asymptotically stable in both approaches.
Finally, we compute approximations of the -norms for the difference between the DAE system and the reduced systems. Figure 9 illustrates the relative errors (37) of the reductions. We recognize that the errors are nearly identical in the two stabilization techniques. The errors stagnate for reduced dimensions in (ii)-(iv), because the total error is dominated by the error of the regularization in this part. Most important, the stabilization approaches do not compromise the error of the MOR.
7 Conclusions
We examined stability-preserving model order reduction of linear stochastic Galerkin systems using transformations to dissipative forms. Three approaches were analyzed. The transformation of the conventional Galerkin system represents an adequate method. The Galerkin projection of the transformed original systems features severe drawbacks in the case of large numbers of random parameters. The approach using a cheap transformation matrix based on a reference parameter is promising. Although the dissipativity property cannot be guaranteed in the transformed system, the numerical results of test examples demonstrate that preservation of stability is achieved. Moreover, the error of the model order reduction does not increase in this stabilization.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Antoulas, A.: Approximation of Large-Scale Dynamical Systems. SIAM Publications, 2005.
- 2[2] Benner, P.; Hinze, M.; ter Maten, E.J.W. (eds.): Model Reduction for Circuit Simulation. Lect. Notes in Electr. Engng. Vol. 74, Springer, 2011.
- 3[3] Benner, P.; Gugercin, S.; Willcox, K.: A survey of projection-based model order reduction methods for parametric dynamical systems. SIAM Review 57 (2015) 483–531.
- 4[4] Benner, P.; Mehrmann, V.; Sorensen, D.C. (eds.): Dimension Reduction of Large-Scale Systems. Lect. Notes in Comput. Sci. Eng. Vol. 45, Springer, 2005.
- 5[5] Benner, P.; Schneider, A.: Balanced truncation model order reduction for LTI systems with many inputs or outputs. In: Edelmayer, A. (eds.), Proceedings 19th International Symposium on Mathematical Theory of Networks and Systems, 2010, pp. 1971–1974.
- 6[6] Benner, P.; Stykel, T.: Model order reduction of differential-algebraic equations: a survey. In: Ilchmann, A.; Reis, T. (eds.), Surveys in Differential-Algebraic Equations IV, Differential-Algebraic Equations Forum, Springer, 2017, pp. 107–160.
- 7[7] Castañé Selga, R.; Lohmann, B.; Eid, R.: Stability preservation in projection-based model order reduction of large scale systems. Eur. J. Control 18 (2012) 122–132.
- 8[8] Ernst, O.G; Mugler, A.; Starkloff, H.J., Ullmann, E.: On the convergence of generalized polynomial chaos expansions. ESAIM: M 2AN 46 (2012) 317–339.
