Computing the stochastic $H^\infty$-norm
Tobias Damm, Peter Benner, Jan Hauth

TL;DR
This paper introduces a method to compute the stochastic $H^inity$-norm of linear systems using a Riccati-type matrix equation, providing a new approach without frequency domain or Hamiltonian conditions.
Contribution
It develops a novel computational technique for the stochastic $H^inity$-norm based on a parametrized algebraic Riccati equation, extending deterministic methods.
Findings
Provides a Riccati-based algorithm for stochastic $H^inity$-norm computation.
Establishes theoretical characterization without frequency domain or Hamiltonian conditions.
Enables practical computation for stochastic control systems.
Abstract
The stochastic -norm is defined as the -induced norm of the input-output operator of a stochastic linear system. Like the deterministic -norm it is characterised by a version of the bounded real lemma, but without a frequency domain description or a Hamiltonian condition. Therefore, we base its computation on a parametrised algebraic Riccati-type matrix equation.
| 10 | 20 | 40 | 80 | 160 | |
|---|---|---|---|---|---|
| LMI | 0.11s | 0.99s | 35.72s | 2030s | - |
| Alg1 | 4.43s | 7.98s | 24.43s | 156.6s | 1156s |
| 25 | 36 | 49 | 64 | 81 | 100 | |
|---|---|---|---|---|---|---|
| LMI | 3.83s | 36.92s | 306.3s | 1810s | 8631s | - |
| Alg1 | 9.56s | 13.25s | 26.29s | 73.38s | 129.1s | 177.3s |
| 0.4724 | 0.4694 | 0.4669 | 0.4647 | 0.4628 | 0.4611 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMatrix Theory and Algorithms · Mathematical Analysis and Transform Methods · Stability and Control of Uncertain Systems
Computing the stochastic -norm
Tobias Damm, Peter Benner, and Jan Hauth T. Damm is with University of Kaiserslautern, Department of Mathematics, 67663 Kaiserslautern, Germany, email: [email protected]. Benner is with the Max Planck Institute for Dynamics of Complex Technical Systems, Sandtorstr. 1, 39106 Magdeburg, Germany e-mail: [email protected]. Hauth is with the Fraunhofer Institute for Industrial Mathematics (ITWM), 67663 Kaiserslautern, Germany, email: [email protected]
(March 2017)
Abstract
The stochastic -norm is defined as the -induced norm of the input-output operator of a stochastic linear system. Like the deterministic -norm it is characterised by a version of the bounded real lemma, but without a frequency domain description or a Hamiltonian condition. Therefore, we base its computation on a parametrised algebraic Riccati-type matrix equation.
1 Introduction
The -norm is a fundamental concept for asymptotically stable deterministic linear time invariant systems. It is equal to the input/output norm of a system both in the frequency and the time domain. It is used in robustness analysis and serves as a performance index in control. In model order reduction, it is an important measure for the quality of the approximation. There are very efficient algorithms for the computation of the -norm, which are based on a Hamiltonian characterization. The most widely used among these was described in [1, 2], but recent progress has been made e.g. in [3, 4, 5, 6].
A stochastic version of the -norm was introduced by Hinrichsen and Pritchard in [7]. It has a similar range of applications as its deterministic counterpart, but its numerical computation has hardly been considered in the literature. A major obstacle in transferring ideas and algorithms from the deterministic case is the lack of a suitable frequency domain interpretation or a Hamiltonian characterization in the stochastic setup.
In this note we present an algorithm to compute the stochastic -norm, based on a Riccati characterization. According to the stochastic bounded real lemma, [7], the norm is given as the infimum of all for which a given parametrized Riccati equation has a stabilizing solution. We check the solvability of the Riccati equation by a Newton iteration.
The paper is structured as follows. In Section 2 we introduce stochastic systems, define the stochastic -norm and state the stochastic bounded real lemma. We also provide a new version of the non-strict bounded real lemma and give some new bounds for the stabilizing solution, which are proven in appendix A.2. In Section 3 we describe our basic algorithm and discuss ways to make all the steps fast. In Section 4 we report on numerical experiments. In particular, we compare our algorithm with an LMI solver. To keep the notational burden low, we confine ourselves to the case, where only one multiplicative noise term affects the state vector. Our results can easily be extended to more general situations which we hint at in appendix A.1.
2 The stochastic -norm
We consider stochastic linear systems of the form
[TABLE]
where , , , , and is a zero mean real Wiener process on a probability space with respect to an increasing family of -algebras (e.g. [8, 9]).
Let denote the corresponding space of non-anticipating stochastic processes with values in and norm
[TABLE]
where denotes expectation. For initial data and input we denote the solution and the output of (1) by and , respectively.
Definition 2.1
System (1) is called asymptotically mean-square-stable, if
[TABLE]
for all initial conditions . In this case, for simplicity, we also call the pair asymptotically mean-square stable.
If is asymptotically mean-square stable, then (1) defines an input-output operator from to via , see [7]. By we denote the induced operator norm,
[TABLE]
which is an analogue of the deterministic -norm. We therefore call it the stochastic -norm of system (1).
2.1 The stochastic bounded real lemma
The norm (2) can be characterized by the stochastic bounded real lemma. To this end, we define the quadratic (Riccati-type) mapping , which depends on the parameter , by
[TABLE]
Its Fréchet derivative at some is the linear mapping given by
[TABLE]
where .
Writing and , we have
[TABLE]
The pair is asymptotically mean-square stable if and only if \sigma(\mathcal{L}_{A}+\Pi_{N})\subset\mathbb{C}_{-}=\{\lambda\in\mathbb{C}\;\big{|}\;\operatorname{Re}\lambda<0\}, e.g. [10].
Theorem 2.2
[7]** Assume that is asymptotically mean-square stable. For , the following are equivalent.
- (i)
.
- (ii)
There exists a negative definite solution to the linear matrix inequality
[TABLE]
- (iii)
There exists a negative definite solution to the strict Riccati inequality .
- (iv)
There exists a solution to the Riccati equation , such that .
Remark 2.3
A solution of the Riccati equation , with is called a stabilizing solution. If it exists, then it is uniquely defined, and is the largest solution of the inequality , see [10]. We will write for this solution. By Theorem 2.2, the norm is the infimum of all such that possesses a stabilizing solution, i.e.
[TABLE]
Under a controllability assumption we can also give a nonstrict version of Theorem 2.2 for asymptotically mean-square stable systems. We define the controllability Gramian of system (1) as the solution of
[TABLE]
If the system is stable, then is nonnegative definite, .
Corollary 2.4
Assume that is asymptotically mean-square stable and in (8). For , the following are equivalent.
- (i)
.
- (ii)
There exists a solution to the linear matrix inequality
[TABLE]
- (iii)
There exists a solution to the Riccati equation .
Moreover, if , then has a largest solution , for which .
This result is slightly stronger than [11, Proposition 9.6] or [10, Corollary 5.3.14], where it was shown that (i) implies (iii) if is controllable. In the appendix we give a new simplified proof, which can also be modified to obtain lower bounds for solutions of (11) as follows.
2.2 Inequalities for solutions of the Riccati equation
Lemma 2.5
Assume that is asymptotically mean-square stable, and . Let be the Moore-Penrose inverse of given by (8). If satisfies (11), then
[TABLE]
Note that is monotonically decreasing. Hence, if (12) is violated for some and , then cannot be a solution of (11). This bound is particularly easy to check.
Alternatively, we may compare with solutions of Riccati equations from deterministic control. Let denote the counterpart of with , i.e.
[TABLE]
Lemma 2.6
*Assume that is asymptotically mean-square stable, and .
Then the Riccati equation from the deterministic case*
[TABLE]
possesses a smallest solution , and for all solutions of (11).
3 Computation of the stochastic -norm
To exploit the characterization (7), we need a method to check, whether the Riccati equation possesses a stabilizing solution. Given the Fréchet derivative of displayed in (3), it is natural to apply Newton’s method to solve the stochastic algebraic Riccati equation from part (iv) of Theorem 2.2. The following result was proven in [11].
Theorem 3.1
Let be mean-square stable and assume that . Consider the Newton iteration
[TABLE]
where we assume . Then the sequence converges to , and for all it holds that
[TABLE]
Moreover, under the given assumptions is a suitable initial guess (see appendix for a proof).
Lemma 3.2
Let be mean-square stable and assume that . Then .
For a given , we can check whether by running the Newton iteration (14) starting from . If all iterates are stabilizing, and the sequence converges with a given level of tolerance, then we conclude that .
Conversely, if , then either for some , or the sequence is monotonically decreasing and unbounded.
If for some the condition is violated or the iteration takes more than a fixed number of steps, then we conclude that . Additionally we might test the conditions of Lemma 2.5 or Lemma 2.6 in each step and conclude that if one of them is not fulfilled. However, in all our examples only the stability condition was relevant.
Using bisection, we can thus compute up to a given precision.
3.1 The basic algorithm
We summarize this approach as our basic algorithm.
The stability test in line 5 and the solution of the linear system in line 6 are central issues. Both concern the generalized Lyapunov mapping . A naive implementation with general purpose eigenvalue and linear system solvers, respectively, would result in an overall complexity of about . About the same complexity is required for LMI-solvers. It is, however, well known that standard Lyapunov equations of the form can be solved in operations, using e.g. the Bartels-Stewart algorithm, [12]. Exploiting this in iterative approaches, we can bring down the complexity of Algorithm 1 at least to . This will be explained briefly in the following two subsections. Moreover, we suggest a way to choose and in line 1.
In the numerical experiments, we will show that our algorithm outperforms general purpose LMI methods.
3.2 The stability test
The condition in line 5 holds if and only if and , where denotes the spectral radius, [10, Theorem 3.6.1]. Hence, we can first check, whether and then apply the power method to compute the spectral radius of . Note that the mapping is nonnegative, in the sense that it maps the cone of nonnegative definite matrices to itself, see [10]. Hence, the iterative scheme
[TABLE]
produces a sequence of nonnegative definite matrices which generically converge to the dominant eigenvector. In the limit we have , i.e. .
3.3 The generalized Lyapunov equation
In the Newton step in line 6, the generalized Lyapunov equation
[TABLE]
has to be solved for to obtain . Equations of this type have been studied e.g. in [13].
Note that satisfies the fixed point equation
[TABLE]
The condition implies , where denotes the spectral radius. Hence the fixed point iteration
[TABLE]
is convergent. In each step this iteration only requires the solution of a standard Lyapunov equation. at a cost at most in . The speed of convergence can be improved by using a Krylov subspace approach like gmres or bicgstab. For details see [13]. More recently, also low-rank techniques have been considered in [14, 15, 16].
3.4 Choosing and
For the bisection it is useful to find suitable upper and lower bounds for . Let be the transfer function of the deterministic system obtained from (1) by replacing with zero. The -norm equals the input-output norm of this deterministic system. Then from Theorem 2.2 we conclude , because the inequality (6) for a given matrix implies that the corresponding linear matrix inequality with holds for the same . Hence, if , then . Therefore, we choose and try . If the Newton iteration does not converge for , then we replace by and repeat the previous step, until we have .
4 Numerical experiments
The following experiments were carried out on a 2011 MacBook Air with a 1.4 GHz Intel Core 2 Duo processor and 4 GB Memory running OS X 10.11.6 using MATLAB® version R2016b.
4.1 Random systems
We first consider random data produced by randn. The matrix is made stable by mirroring the unstable eigenvalues at . Then the spectral radius of is estimated as described in subsection 3.2 and an update of is obtained by multiplication with . Thus is guaranteed to be mean-square stable. We compute the stochastic -norm by our algorithm and compare it with the result obtained by the MATLAB®-function mincx, see appendix A.3. In all our tests, the relative difference of the computed norms lies within the chosen tolerance level. The computing times, however, differ significantly, see Table 1. While for small dimensions the implementation of the LMI-solver seems to be superior to our implementation, for larger the algorithmic complexity becomes relevant. For the LMI-solver is impractical.
4.2 A heat transfer problem
This stochastic modification of a heat transfer problem described in [17] was also discussed in [18]. On the unit square , the heat equation for is given with Dirichlet condition , , on three of the boundary edges and a stochastic Robin condition on the fourth edge (where stands for white noise). We measure the average value
A standard 5-point finite difference discretization on a grid leads to a modified Poisson matrix with and corresponding matrices , . The -norm of this discretization approximates the induced input/output norm of the partial differential equation. Table 2 shows the computing times (in seconds) and the computed norms (which coincide) for the two methods. For the LMI-solver took to long to be considered.
Again, we observe that Algorithm 1 allows to treat larger dimensions than the LMI-solver. However, the computing times for our algorithm also grow fairly fast. As an alternative to bisection one might consider extrapolating the spectral radii which are computed in the course of the process for , or perhaps the spectral abscissae . Then the norm is given as the value of , where , or . Unfortunately, the slopes of and are very steep as approaches . Thus an extrapolation does not seem promising. The behaviour is visualized for the heat equation system with in Figure 1.
5 Conclusions
We have suggested an algorithm to compute the stochastic -norm. It builds upon several ideas developed in the literature, and is the first algorithm, whose complexity is considerably smaller than that of a general purpose LMI-solver. We chose to present the algorithm for the simplest case of just one multiplicative noise term, which however can easily be generalized to the class described in appendix A.1. Already in the simple case, the stochastic -norm is much harder to compute than the -norm of a deterministic system, and the computing times are still very high. We see it as a challenge to come up with a faster method.
The note also contains some extensions of known results with new proofs, like the nonstrict stochastic bounded real lemma and lower bounds for Riccati solutions.
Appendix A Appendix
A.1 Generalization
System (1) can be generalized in a straight-forward manner to the case of multiple noise terms at the state and the input (see e.g. [10]). Then our system takes the form
[TABLE]
where , and the are independent Wiener processes. The Riccati operator then takes the form
[TABLE]
[TABLE]
Our basic algorithm and all our considerations carry over to this case literally. Only the expressions for and become more technical.
A.2 Proofs
As above, we write
[TABLE]
On the space of symmetric matrices we consider the scalar product , and note that the corresponding adjoint operators are
[TABLE]
Further facts on Riccati- and Lyapunov-type operators are cited from [10].
Proof of Corollary 2.4: (iii)(ii) follows from the definiteness criterion via the Schur-complement.
(ii)(iii): If (ii) holds, then , and by [11] there exists a solution to the equation .
(ii)(i): If (6) holds and we replace and by and , then we get
[TABLE]
This implies for the corresponding modified input-output operator. By as , we obtain .
(i)(ii): If (i) holds, then for all , . Hence there exist stabilizing solutions of . Moreover, is the largest solution of (11) with replaced by . Hence it follows that for all . If the are bounded below, then the sequence converges and the limit satisfies the nonstrict linear matrix inequality in (iii). Thus it suffices to show boundedness. We assume that the sequence is not bounded, i.e. for . Consider the normalized sequence , which – by Bolzano-Weierstrass – has a convergent subsequence with limit . Then
[TABLE]
implying and . Since, by assumption , we obtain
[TABLE]
which is a contradiction.
Thus, has a solution , which is the limit of the largest and stabilizing solutions of . Thus is the largest solution of and . If then and [10, Theorem 3.2.3] yields that .
Proof of Lemma 2.5:
The controllability Gramian is given by
[TABLE]
In the following consider an arbitrary matrix , , satisfying . Then
[TABLE]
There exists a vector with and Moreover
[TABLE]
for . To see this, note that the image of is contained in the image of . Hence there exists a unitary , such that
[TABLE]
The largest zero of is
[TABLE]
For , we have which proves (19).
We set . Let now satisfy (11). With the given data and this implies
[TABLE]
If we assume , then the right hand is negative for , which is a contradiction.
Hence, we have .
Proof of Lemma 2.6: Note that if and and thus every solution of also satisfies . Hence possesses a stabilizing solution, and, consequently, also an anti-stabilizing solution , which is the smallest solution of . Thus also for every solution of .
Proof of Lemma 3.2: We exploit the concavity of and the resolvent positivity of , see [10]. If , then there exists such that, by concavity,
[TABLE]
Assume that . Then by [10, Theorem 3.2.3] there exists , , such that . Taking the scalar product of inequality (20) with , we get
[TABLE]
It follows that , which implies and thus . But then in contradiction to the stability of .
A.3 Usage of LMI-solver
The LMI-solver was used as in the following MATLAB® listing.
1lmis([])
2X = lmivar(1,[n,1]);g = lmivar(1,[1,1]);
3lmiterm([1 1 1 X],N',N);
4lmiterm([1 1 1 X],A',1,'s');
5lmiterm([1 1 1 0],C'*C);
6lmiterm([1 1 2 X],1,B,'s');
7lmiterm([1 2 2 g],-1,1);
8lmisys = getlmis;
9c = mat2dec(lmisys,zeros(n),1);
10options = [tol,0,0,0,1];
11copt = mincx(lmisys,c,options);
12gamma = sqrt(copt)
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] S. Boyd and V. Balakrishnan, “A regularity result for the singular values of a transfer matrix and a quadratically convergent algorithm for computing its L ∞ superscript 𝐿 L^{\infty} -norm,” Syst. Control Lett. , vol. 15, no. 1, pp. 1–7, 1990.
- 2[2] N. A. Bruinsma and M. Steinbuch, “A fast algorithm to compute the H ∞ superscript 𝐻 H^{\infty} -norm of a transfer function matrix,” Syst. Control Lett. , vol. 14, pp. 287–293, 1990.
- 3[3] N. Guglielmi, M. Gürbüzbalan, and M. L. Overton, “Fast approximation of the H ∞ subscript 𝐻 H_{\infty} -norm via optimization over spectral value sets,” SIAM J. Matrix Anal. Appl. , vol. 34, no. 2, pp. 709–737, 2016.
- 4[4] P. Benner and M. Voigt, “A structured pseudospectral method for ℋ ∞ subscript ℋ \mathcal{H}_{\infty} -norm computation of large-scale descriptor systems,” Math. Control Signals Syst. , vol. 26, no. 2, pp. 303–338, 2014.
- 5[5] M. A. Freitag, A. Spence, and P. Van Dooren, “Calculating the H ∞ subscript 𝐻 H_{\infty} -norm using the implicit determinant method,” Linear Algebra Appl. , vol. 35, no. 2, pp. 619–635, 2014.
- 6[6] N. Aliyev, P. Benner, E. Mengi, and M. Voigt, “Large-scale computation of ℋ ∞ subscript ℋ \mathcal{H}_{\infty} -norms by a greedy subspace method,” submitted to SIAM J. Matrix Anal. Appl. , 2016.
- 7[7] D. Hinrichsen and A. J. Pritchard, “Stochastic H ∞ subscript 𝐻 {H}_{\infty} ,” SIAM J. Control Optim. , vol. 36, no. 5, pp. 1504–1538, 1998.
- 8[8] L. Arnold, Stochastic Differential Equations: Theory and Applications. Translation. New York etc.: John Wiley and Sons Inc., 1974.
