Acceleration of the Power Method with Dynamic Mode Decomposition

Jeremy A. Roberts; Leidong Xu; Rabab Elzohery; Mohammad Abdo

arXiv:1904.09493·physics.comp-ph·April 23, 2019

Acceleration of the Power Method with Dynamic Mode Decomposition

Jeremy A. Roberts, Leidong Xu, Rabab Elzohery, Mohammad Abdo

PDF

TL;DR

This paper introduces a DMD-accelerated power method that significantly reduces iterations needed to find dominant eigenmodes, especially useful as a post-processing step when traditional eigensolvers are unsuitable.

Contribution

The paper presents a novel DMD-based acceleration technique for the power method, enabling faster convergence and applicability as a post-process in existing power-method workflows.

Findings

01

DMD-PM(n) reduces power iterations by about 5 times.

02

Arnoldi's method outperforms DMD-PM(n) in efficiency.

03

DMD-PM(n) is useful as a post-processing tool for power methods.

Abstract

Presented is an algorithm based on dynamic mode decomposition (DMD) for acceleration of the power method (PM). The power method is a simple technique for determining the dominant eigenmode of an operator $A$ , and variants of the power method are widely used in reactor analysis. Dynamic mode decomposition is an algorithm for decomposing a time-series of spatially-dependent data and producing an explicit-in-time reconstruction for that data. By viewing successive power-method iterates as snapshots of a time-varying system tending toward a steady state, DMD can be used to predict that steady state using (a sometime surprisingly small) $n$ iterates. The process of generating snapshots with the power method and extrapolating forward with DMD can be repeated. The resulting restarted, DMD-accelerated power method (or DMD-PM( $n$ )) was applied to the two-dimensional IAEA diffusion…

Figures8

Click any figure to enlarge with its caption.

Equations30

Fx = k Lx,

Fx = k Lx,

Ax = λ x .

Ax = λ x .

x^{(0)} = c_{0}^{'} x_{0} + c_{1}^{'} x_{1} + c_{2}^{'} x_{2} \dots = c_{0}^{'} (x_{0} + \frac{c _{1}^{'}}{c _{0}^{'}} x_{1} + \frac{c _{2}^{'}}{c _{0}^{'}} x_{2} \dots) = c_{0}^{'} (x_{0} + c_{1} x_{1} + c_{2} x_{2} \dots) .

x^{(0)} = c_{0}^{'} x_{0} + c_{1}^{'} x_{1} + c_{2}^{'} x_{2} \dots = c_{0}^{'} (x_{0} + \frac{c _{1}^{'}}{c _{0}^{'}} x_{1} + \frac{c _{2}^{'}}{c _{0}^{'}} x_{2} \dots) = c_{0}^{'} (x_{0} + c_{1} x_{1} + c_{2} x_{2} \dots) .

A x^{(0)} = A x_{0} + c_{1} A x_{1} + c_{2} A x_{2} + \dots = λ_{0} x_{0} + c_{1} λ_{1} x_{1} + c_{2} λ_{2} x_{2} + \dots = λ_{0} (x_{0} + c_{1} \frac{λ _{1}}{λ _{0}} x_{1} + c_{2} \frac{λ _{2}}{λ _{0}} x_{2} + \dots) .

A x^{(0)} = A x_{0} + c_{1} A x_{1} + c_{2} A x_{2} + \dots = λ_{0} x_{0} + c_{1} λ_{1} x_{1} + c_{2} λ_{2} x_{2} + \dots = λ_{0} (x_{0} + c_{1} \frac{λ _{1}}{λ _{0}} x_{1} + c_{2} \frac{λ _{2}}{λ _{0}} x_{2} + \dots) .

A^{n} x^{(0)} = λ_{0}^{n} (x_{0} + c_{1} (\frac{λ _{1}}{λ _{0}})^{n} x_{1} + c_{2} (\frac{λ _{2}}{λ _{0}})^{n} x_{2} + \dots),

A^{n} x^{(0)} = λ_{0}^{n} (x_{0} + c_{1} (\frac{λ _{1}}{λ _{0}})^{n} x_{1} + c_{2} (\frac{λ _{2}}{λ _{0}})^{n} x_{2} + \dots),

\frac{d x}{d t} = f (x, t),

\frac{d x}{d t} = f (x, t),

\frac{d x ( t )}{d t} = A x,

\frac{d x ( t )}{d t} = A x,

X_{0} = [x_{0}, x_{1}, \dots, x_{m - 1}] and X_{1} = [x_{1}, x_{2}, \dots, x_{m}],

X_{0} = [x_{0}, x_{1}, \dots, x_{m - 1}] and X_{1} = [x_{1}, x_{2}, \dots, x_{m}],

X_{1} \approx A X_{0},

X_{1} \approx A X_{0},

ω_{i} = \frac{ln ( λ _{i} )}{Δ} .

ω_{i} = \frac{ln ( λ _{i} )}{Δ} .

x (t) \approx i = 1 \sum m z_{i} e^{ω_{i} t} b_{i} = Z e^{Ω t} b,

x (t) \approx i = 1 \sum m z_{i} e^{ω_{i} t} b_{i} = Z e^{Ω t} b,

A = X_{1} X_{0}^{†},

A = X_{1} X_{0}^{†},

X_{0} = U Σ V^{*} \to X_{0}^{†} = V Σ^{- 1} U^{*},

X_{0} = U Σ V^{*} \to X_{0}^{†} = V Σ^{- 1} U^{*},

x^{(i + 1)} = f (x^{(i)}) = \frac{A x ^{(i)}}{∣∣ A x ^{(i)} ∣∣}, i = 0, 1, \dots

x^{(i + 1)} = f (x^{(i)}) = \frac{A x ^{(i)}}{∣∣ A x ^{(i)} ∣∣}, i = 0, 1, \dots

- \nabla D (r)_{1} \nabla ϕ_{1} (r) + Σ_{r 1} (r) ϕ_{1} (r) - \nabla D (r)_{2} \nabla ϕ_{2} (r) + Σ_{a 2} (r) ϕ_{2} (r) = \frac{1}{k} (ν Σ_{f 1} ϕ_{1} (r) + ν Σ_{f 2} ϕ_{2} (r)) = Σ_{s 1 \to 2} ϕ_{1} (r),

- \nabla D (r)_{1} \nabla ϕ_{1} (r) + Σ_{r 1} (r) ϕ_{1} (r) - \nabla D (r)_{2} \nabla ϕ_{2} (r) + Σ_{a 2} (r) ϕ_{2} (r) = \frac{1}{k} (ν Σ_{f 1} ϕ_{1} (r) + ν Σ_{f 2} ϕ_{2} (r)) = Σ_{s 1 \to 2} ϕ_{1} (r),

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

\newsiamremark

remarkRemark \newsiamremarkhypothesisHypothesis

\newsiamthmclaimClaim \headersAcceleration of the Power Method with DMDJ. Roberts, L. Xu, R. Elzohery, M. Abdo

Acceleration of the Power Method with Dynamic Mode Decomposition

Jeremy A. Roberts Department of Mechanical and Nuclear Engineering, Kansas State University, Manhattan, KS 66506 () [email protected]

Leidong Xu

Rabab Elzohery

Mohammad Abdo

Abstract

Presented is an algorithm based on dynamic mode decomposition (DMD) for acceleration of the power method (PM). The power method is a simple technique for determining the dominant eigenmode of an operator $\mathbf{A}$ , and variants of the power method are widely used in reactor analysis. Dynamic mode decomposition is an algorithm for decomposing a time-series of spatially-dependent data and producing an explicit-in-time reconstruction for that data. By viewing successive power-method iterates as snapshots of a time-varying system tending toward a steady state, DMD can be used to predict that steady state using (a sometime surprisingly small) $n$ iterates. The process of generating snapshots with the power method and extrapolating forward with DMD can be repeated. The resulting restarted, DMD-accelerated power method (or DMD-PM( $n$ )) was applied to the two-dimensional IAEA diffusion benchmark and compared to the unaccelerated power method and Arnoldi’s method. Results indicate that DMD-PM( $n$ ) can reduce the number of power iterations required by approximately a factor of 5. However, Arnoldi’s method always outperformed DMD-PM( $n$ ) for an equivalent number of matrix-vector products $\mathbf{Av}$ . In other words, DMD-PM( $n$ ) cannot compete with leading eigensolvers if one is not limited to snapshots produced by the power method. Contrarily, DMD-PM( $n$ ) can be readily applied as a post process to existing power-method applications for which Arnoldi and similar methods are not directly applicable. A slight variation of the method was also found to produce reasonable approximations to the first and second harmonics without substantially affecting convergence of the dominant mode.

keywords:

power method, dynamic mode decomposition, acceleration

{AMS}

65F15

1 Introduction

Eigenvalue problems often arise from equations that describe steady-state systems. In some applications, the fundamental (or dominant) eigenmode (i.e., the eigenvector corresponding to the eigenvalue of largest magnitude) represents the state of the system that would be observed in true, steady-state conditions. One such application arises in the analysis of nuclear reactors, for which the balance of neutrons is modeled using a linearized Boltzmann equation or a related diffusion approximation; either can be represented by the generalized eigenvalue problem

[TABLE]

where $\mathbf{x}$ describes the neutron population, $\mathbf{L}$ represents system losses, $\mathbf{F}$ represents system gains, and the eigenvalue $k=||\mathbf{F}\mathbf{x}||/||\mathbf{L}\mathbf{x}||$ represents the ratio of gains to losses.

To determine the fundamental mode, a variety of algorithms exist, but perhaps the simplest is the power method. The application of the power method to a steady-state system (e.g., a nuclear reactor model) can be interpreted as follows. The system is initialized in an unsteady state corresponding to some linear combination of the system modes (i.e., the eigenvectors of the operator $\mathbf{L}^{-1}\mathbf{F}$ ). The dominant mode has a corresponding frequency $\omega_{0}=0$ , i.e., its temporal evolution is governed by $e^{\omega_{0}t}=1$ . All higher-order modes have frequencies $\omega_{i}<0$ for $i>0$ and, therefore, decay in time. Each iteration of the power method moves the system forward one characteristic step in time, and, therefore, the higher-order modes decay at each step. The method continues until these higher-order modes are sufficiently decayed.

These frequencies and the spatial shapes to which they correspond are precisely what dynamic mode decomposition (DMD) reveals. First proposed to infer system dynamics from a time series of fluid flow-field observations [15, 16], the basic DMD algorithm has been rigorously analyzed mathematically [17], and extended to a variety of applications thoroughly summarized in a recent monograph [12]. In some applications, DMD has been used to recover approximations to physical parameters from the output of existing (possibly black-box) models that would otherwise require significant changes be made to those models. For example, DMD was used to recover $\alpha$ eigenvalues from time-dependent, neutron transport calculations [13]. Others have used DMD as a direct, explicit-in-time surrogate for such black-box models, e.g., to model the evolution of nuclear reactor isotopics over long time periods [2, 9], the nonlinear response of reactor power during short transients [1], and nuclear-fuel, decay-heat generation [3].

Of particular relevance here are two past applications of DMD to accelerate existing, iterative methods. In perhaps the first such application, a modified DMD was used to accelerate the convergence of time-dependent, computational fluid dynamics models to their steady state by starting from (non-steady) initial conditions and marching forward in time until converged [4]. In that method, a system is integrated over a time step to produce a sequence of flow-field snapshots of the form $\mathbf{x}^{(n+1)}=\mathbf{A}\mathbf{x}^{(n)}+\mathbf{b}$ . With use of $\mathbf{X}_{n}=[\mathbf{x}^{(2)}-\mathbf{x}^{(1)},\ldots,\mathbf{x}^{(n+1)}-\mathbf{x}^{(n)}]$ , a low-rank approximation $\tilde{\mathbf{A}}$ can be formed using the basic idea of DMD to be discussed Section 2.2 and used to approximate the correction $\Delta\mathbf{x}$ that satisfies the steady-state condition $(\mathbf{I}-\mathbf{A})\Delta\mathbf{x}=(\mathbf{A}-\mathbf{I})\mathbf{x}^{(n+1)}+\mathbf{b}$ . The corrected steady-state solution is $\mathbf{x}=\mathbf{x}^{(n+1)}+\Delta\mathbf{x}$ . By performing several rounds of time stepping followed by a correction, both the number of iterations and the average fluctuation in successive iterates was reduced for the problems studied[4].

In a similar fashion, DMD was applied to source-driven neutronics problems through acceleration of Richardson (or “source”) iteration for the inhomogeneous equation $\mathbf{L}\mathbf{x}=\mathbf{b}$ (where $\mathbf{L}$ can be the same as in Eq. (1)) [14]. Richardson iteration leads to the sequence $\mathbf{x}^{(n+1)}=(\mathbf{I}-\mathbf{L})\mathbf{x}^{(n)}+\mathbf{b}$ . As in Ref. [4], a set of successive differences $\mathbf{x}^{(n)}-\mathbf{x}^{(n-1)}$ can be used to produce an approximate operator $\tilde{\mathbf{A}}$ , and a correction $\Delta\mathbf{x}$ can be used to provide an improved estimate for $\mathbf{x}=\mathbf{x}^{(n)}+\Delta\mathbf{x}$ . Results from several test cases suggest that a sequence of Richardson iterations followed by corrections reduces the number of iterations required by about one order of magnitude.

Building on these past successes, it is shown here that DMD can be used with the power method to estimate the frequencies and spatial modes of a reactor system as it converges to its fundamental, steady-state mode. The system can then be projected forward in “time” to recover a solution that would be observed after (possibly many) additional iterations of the power method. This process can be repeated, i.e., the power method can provide snapshots of the system in time, and DMD can be used to extrapolate forward in time based on those snapshots to produce a starting point for an addition series of restarted power-method/DMD iterations.

2 Methodology

2.1 The Power Method

The power method is a simple, iterative scheme used to determine the dominant eigenpair of the system

[TABLE]

Let $\lambda_{i}$ represent the $i$ th eigenvalue of $\mathbf{A}$ . Further, let these eigenvalues be numbered in decreasing order based magnitude, i.e., $|\lambda_{0}|>|\lambda_{1}|\geq|\lambda_{2}|\ldots$ . The dominant eigenpair are those $\lambda_{0}$ and $\mathbf{x}_{0}$ that satisfy $\mathbf{Ax}_{0}=\lambda_{0}\mathbf{x}_{0}$ .

The power method proceeds as follows:

Guess $\mathbf{x}^{(0)}$ and normalize it such that $||\mathbf{x}^{(0)}||=1$ . 2. 2.

Update $\mathbf{x}^{(i)}=\mathbf{A}\mathbf{x}^{(i-1)}$ and $\lambda^{(i)}=||\mathbf{x}^{(i-1)}||$ , where $(i)$ represents the iteration. 3. 3.

Set $\mathbf{x}^{(i)}=\mathbf{x}^{(i)}/\lambda^{(i)}$ . 4. 4.

Repeat steps 2 and 3 for $i=1,2,\ldots$ until $||\mathbf{x}^{(i)}-\mathbf{x}^{(i-1)}||<\tau$ for some tolerance $\tau$ .

As long as the fundamental mode (and eigenvalue) are real and the initial guess $\mathbf{x}^{(0)}$ is not perpendicular to the fundamental mode $\mathbf{x}_{0}$ (i.e., $\mathbf{x}^{T}_{0}\mathbf{x}^{(0)}\neq 0$ ), the power method will converge to the dominant eigenpair $(\mathbf{x}_{0},\lambda_{0})$ .

The rate of convergence of the power method depends on the relative magnitudes of the leading eigenvalues. Let the initial guess be represented as a weighted sum of the eigenvectors of $\mathbf{A}$ , i.e.,

[TABLE]

Because normalization of an eigenvector is arbitrary, let $c_{0}^{\prime}=1$ . Then, multiplication of $\mathbf{A}$ by this initial guess leads to

[TABLE]

Repeated application of $\mathbf{A}$ leads to

[TABLE]

which shows that if $|\lambda_{0}|>|\lambda_{1}|$ , then $\mathbf{A}^{n}\mathbf{x}^{(0)}$ will tend toward the direction $\mathbf{x}_{0}$ at a rate governed by the dominance ratio $|\lambda_{1}|/|\lambda_{0}|$ . Because $\lambda^{n}_{0}$ may grow without bound (or vanish), normalization is required during the iteration as is included in the algorithm above.

2.2 Dynamic Mode Decomposition

The dynamic mode decomposition has been widely described in the literature, and the concise overview given here adapts the lucid presentation given in Ref. [12]. To start, consider the generic, dynamic problem defined by

[TABLE]

where $\mathbf{x}\in\mathbb{R}^{n}$ is the $n$ -dimensional state vector at time $t$ . Suppose that the evolution of $\mathbf{x}$ can be well approximated by a relationship of the form

[TABLE]

where the mapping operator $\mathcal{A}$ may not be known explicitly. However, if one has a sequence of samples $\mathbf{x}(t_{i})$ for $t_{i}=\Delta i$ , $i=0,1,\ldots,m$ and defines

[TABLE]

then DMD yields the operator $\mathbf{A}$ (or, as discussed below, lower-rank approximations to it) that best satisfies

[TABLE]

in a least-squares sense. Because the solution to Eq. (7) is $\mathbf{x}(t)=e^{\mathcal{A}t}\mathbf{x}(0)$ , $\mathbf{A}$ is the discrete-time approximation to $e^{\mathcal{A}\Delta}$ . Moreover, the discrete eigenvalues $\lambda_{i}$ of $\mathbf{A}$ are related to the continuous-time eigenvalues $\omega_{i}$ of $\mathcal{A}$ by

[TABLE]

Corresponding to $\lambda_{i}$ is the eigenvector $\mathbf{z}_{i}$ of $\mathbf{A}$ , and the complete set of eigenpairs of $\mathbf{A}$ yields

[TABLE]

and, to satisfy $\mathbf{x}(0)=\mathbf{Z}\mathbf{b}$ in a least-squares sense, let $\mathbf{b}=\mathbf{Z}^{\dagger}\mathbf{x}(0)$ , where $\dagger$ indicates the Moore-Penrose pseudoinverse.

The best-fit operator $\mathbf{A}$ is formally given by

[TABLE]

where $\mathbf{X}^{\dagger}_{0}$ can be defined from its (thin) singular-value decomposition, i.e.,

[TABLE]

where $\mathbf{U}\in\mathbb{C}^{m\times n}$ , $\mathbf{V}\in\mathbb{C}^{n\times n}$ , $\bm{\Sigma}\in\mathbb{C}^{n\times n}$ , and $*$ indicates the conjugate transpose. It follows that $\mathbf{A}=\mathbf{X}_{1}\mathbf{V}\bm{\Sigma}^{-1}\mathbf{U}^{*}$ . However, this matrix is large, and, in practice, the low-rank approximation ${\mathbf{\tilde{A}}=\mathbf{U}^{*}_{r}\mathbf{A}\mathbf{U}_{r}}$ can be used. Here, $\mathbf{U}_{r}$ contains the first $r<n$ columns of the left singular matrix, usually arranged to correspond to the largest $r$ singular values (but alternative selection schemes have been explored [12]) Finally, the leading $r$ eigenvalues of $\mathbf{A}$ are inferred from the eigenvalues of $\mathbf{\tilde{A}}$ , while the corresponding eigenvectors $\mathbf{Z}$ of $\mathbf{A}$ are recovered from the eigenvectors $\mathbf{W}$ of $\tilde{\mathbf{A}}$ through $\mathbf{Z}=\mathbf{X}_{1}\mathbf{V}_{r}\bm{\Sigma}_{r}^{-1}\mathbf{W}$ .

2.3 An Accelerated Power Method using DMD

The power method for the system $\mathbf{A}\mathbf{x}=\lambda\mathbf{x}$ can be represented by the sequence

[TABLE]

Suppose that $m$ power iterations have been performed to produce $\mathbf{X}_{0}$ and $\mathbf{X}_{1}$ , where each iterate is separated by a (fictional) $\Delta$ in time. Application of DMD leads to a set of approximate eigenvectors for $\mathbf{A}$ and a set of eigenvalues $e_{j}\approx\lambda_{j}/\lambda_{0}$ . The leading eigenvalue $e_{0}$ tends toward unity, indicating a stationary process (and the convergence of the power method to the fundamental mode).

The normalization in Eq. (14) may be important for reducing numerical round-off errors introduced by growing (or decaying) iterates. However, the normalization introduces nonlinearity to $\mathbf{f}$ . This nonlinearity has not led to numerical difficulties, and this may be explained by the strong connection between DMD and the Koopman operator [12, 5], which is the infinite-dimensional, linear operator $\mathcal{K}$ that maps $\mathbf{g}(\mathbf{x}(t_{n+1}))=\mathcal{K}\mathbf{g}(\mathbf{x}(t_{n}))$ for some (possibly nonlinear) observation function $\mathbf{g}(\mathbf{x})$ and iterates $\mathbf{x}(t_{i})$ of Eq. (6).

In order to accelerate the power method with DMD, the following DMD-PM( $n$ ) algorithm is proposed:

Guess $\mathbf{x}^{(0)}$ and normalize. 2. 2.

Perform $n$ power iterations to produce $\mathbf{X}_{0}$ and $\mathbf{X}_{1}$ 3. 3.

Compute the DMD modes and frequencies using a rank- $r$ , truncated SVD (i.e., $r<n$ ) 4. 4.

Apply Eq. (11) to estimate $\mathbf{x}^{(\infty)}=\mathbf{x}(\infty)$ , i.e., estimate the steady-state, dominant mode after an equivalent of $\infty$ power iterations. 5. 5.

Set $\mathbf{x}^{(0)}=\Re(\mathbf{x}(\infty))/||\mathbf{x}(\infty)||$ . 6. 6.

Repeat Steps 1 through 5 until converged.

In practice, a large value can be used in place for $\infty$ . Alternatively, because the power method must reveal a single, dominant mode if the assumptions described in Section 2.1 are satisfied, then only one mode recovered by DMD should remain at $t=\infty$ . In other words, one can directly set $\mathbf{x}^{(0)}=b_{0}\mathbf{z}_{0}/||b_{0}\mathbf{z}_{0}||$ (where $b_{0}$ ensures a consistent sign).

One numerical challenge faced by the algorithm described is that after several passes, the iterates produced in Step 2 are increasingly ill conditioned. Evaluation of the complete (reduced) SVD of rank ( $n-1$ ) may lead to numerical errors, and to mitigate these effects, an optimal rank may be selected [10]. In addition, it was found that exclusion of the DMD-corrected iterate generated in Step 4 from $\mathbf{X}_{0}$ improves the numerical stability; a further study is warranted to understand the impact of the initial vector on DMD’s performance.

As described, the DMD-accelerated power method constructs a sequence of $n$ vectors that form a basis for an $n$ -dimensional Krylov subspace and attempts to extract spectral properties about $\mathbf{A}$ from that basis. This process bears obvious similarity to the Arnoldi method, with a major difference being that the DMD acceleration post-processes the existing basis while the Arnoldi method continually refines the basis through orthonormalization. Based on scoping studies, it would appear that given $n$ possible applications of $\mathbf{A}$ to a starting vector $\mathbf{x}^{(0)}$ , the Arnoldi method is almost certain to provide a better estimate for $\mathbf{x}_{0}$ than DMD-PM.

3 Numerical Results

To illustrate the DMD-PM algorithm proposed in Section 2.3, it was used to determine the fundamental eigenmode for the well-known, 2-D IAEA diffusion benchmark [6]. Shown in Figure 1 is the basic core layout.

The governing equations are

[TABLE]

where the notation is standard [8], and all parameters are defined in the benchmark documentation [6]. A mesh-centered, finite-volume approximation was used with a uniform, $45\times 45$ spatial mesh. Upon discretization, the entire set of equations was cast in terms of the fission source density, i.e., $\mathbf{f}=\nu\Sigma_{f1}\phi_{1}(\mathbf{r})+\nu\Sigma_{f2}\phi_{2}(\mathbf{r})$ , which results in a $2025\times 2025$ operator. Hence, the problem is by no means a large one, but it proved to be a valuable test case for the method.

All calculations were initialized with a starting vector of which each element was drawn from the uniform distribution $U[0,1]$ . This randomized starting vector helps to ensure that all eigenmodes can be present. A formal sensitivity study was not performed to understand how this initial guess impacts the algorithm performance, but scoping studies suggest there is little impact on the numbers of iterations required for any particular algorithm. In all cases, a reference solution was computed using the implicitly-restarted Arnoldi method as implemented in SciPy [11]. All DMD calculations were performed using PyDMD [7].

3.1 Skipping Ahead with DMD-PM( $n$ )

As a first test of the method, a series of $n$ power iterations were performed, from which a DMD surrogate following Eq. (11) was defined. The dominant eigenmode was reconstructed as a function of iteration, and the error with respect to the reference eigenmode was computed. Shown in Figure 2 are the results for several values of $n$ . Errors in the eigenmode as computed from standard power iterations are also shown. All errors are defined as $||\mathbf{x}^{\text{reference}}-\mathbf{x}^{\text{approximate}}||_{2}$ .

The DMD surrogates reproduce the eigenmodes very accurately for $n$ or fewer iterations. The error approaches an asymptotic, lower bound as predictions are made beyond the number of power iterations used to generate the DMD surrogate. Shown in parentheses are the number of equivalent power iterations to which the final, saturated error in the DMD prediction corresponds. For example, application of 30 power iterations leads to a DMD surrogate that can predict an eigenmode with an accuracy equal to 149 power iterations, a substantial skip ahead in the number of iterations. This skipping ahead is precisely what the algorithm described in Section 2.3 facilitates.

3.2 Application of Restarted DMD-PM $(n)$

To test the iterative application of the DMD-accelerated power method, the same problem was solved for a variety of restart values $n$ . The results are shown in Figure 3. Also included are errors for the power method (PM) and Arnoldi’s method. Here, the Arnoldi method was used without restarts. The results shown for Arnoldi are as a function of the size of the subspace used.

The restart values for DMD-PM( $n$ ) were selected so that the same number of matrix-vector multiplications $\mathbf{A}\mathbf{v}$ were used to construct the underlying DMD surrogate. Here, the extrapolation is counted as an iteration (along the horizontal axis), so the final end points do not match exactly. Although all four applications of DMD-PM( $n$ ) significantly accelerate the power method, use of a larger restart value appears to produce slightly better acceleration (at the cost, of course, of storing a larger number of snapshots). For reference, approximately 800 power iterations are required for similar, final errors (i.e., $10^{-14}$ ).

A stark difference can be observed between the DMD-PM( $n$ ) performance and that of Arnoldi’s method. Whereas DMD-PM( $n$ ) requires more than 150 iterations to converge to within, say, $10^{-13}$ , Arnoldi’s method requires fewer than 40. This difference is not altogether surprising. Arnoldi is based on a subspace that undergoes continuous orthonormalization, which produces a better-conditioned and, likely, a richer basis than can be produced by successive application of $\mathbf{A}$ to a single vector.

3.3 Restarted DMD-PM( $n$ ) for Higher Modes

Like Arnoldi’s method (and others based on construction of Krylov or other subspaces), DMD-PM( $n$ ) can recover higher-order modes, at least approximately. However, an unrestarted DMD-PM approximation leads to an ill-conditioned basis and, hence, cannot produce approximations for higher-order modes with reliable accuracy. Moreover, the iterative DMD-PM( $n$ ) is, by design, ill suited for recovering higher-order modes because it essentially throws away all higher-order modes upon the restart.

Consequently, a slight variation of the iterative, restarted DMD-PM( $n$ ) algorithm was tested. Rather than keeping only the DMD mode corresponding to the largest eigenvalue, the dominant mode was kept with a small contribution from the next two modes in order to capture the three modes with the largest eigenvalues. Specifically, the initial guess between restarts was chosen to be the $\mathbf{z}_{0}+\epsilon(\mathbf{z}_{1}+\mathbf{z}_{2})$ , where $\epsilon$ is a small value (here, $10^{-4}$ ) that guarantees that the next iteration is started with at least some contribution from the higher-order space. The largest three eigenvalues and their corresponding modes were recovered from the DMD calculation after the second, third, and fourth iterations of DMD-PM( $30$ ) as approximations for the first three eigenpairs of the original system. Shown in Figure 4 are the reference modes. The corresponding, absolute errors in the DMD-PM( $n$ ) approximations are shown in Figure 5. All computed eigenvectors were normalized to unity. Errors in higher-order mode estimates were found to depend somewhat on the randomized initial guess, but those shown are representative values.

As can be expected, the errors in the two higher-order modes (and their eigenvalues) are much larger than the error for the dominant mode. Moreover, these errors decrease somewhat with each iteration. The error in the dominant mode after two iterations ( $1.77\times 10^{-7}$ ) is nearly unchanged from the case in which higher modes are not kept ( $1.54\times 10^{-7})$ ; see Figure 3. However, the performance does degrade somewhat thereafter, with errors after three and four iterations of approximately $5.06\times 10^{-9}$ and $1.65\times 10^{-10}$ , respectively, compared to $9.90\times 10^{-11}$ and $3.00\times 10^{-12}$ in Figure 3.

4 Conclusions

The DMD-PM( $n$ ) method was found to provide reasonable ( $5\times$ ) speedup compared to unaccelerated power iterations. Although not competitive with Arnoldi for the test problem studied, there do exist applications for which access to iterates is only available in a post-processing sense. In reactor analysis, the use of the power method in Monte Carlo simulations is widespread. Application of DMD-PM( $n$ ) in that setting may be an obvious solution for convergence acceleration (a common problem) and, potentially, a tool for analysis of source convergence and tally uncertainties. Also in that setting, the continual perturbation of higher modes due to uncertainties may make their computation easier and more robust (relative to the deterministic examples shown above).

Acknowledgments

The material presented is based on work supported in part by a Faculty Development Grant awarded to Kansas State University by the U.S. Nuclear Regulatory Commission under award number NRC-HQ-60-17-G-0010.

Bibliography17

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] M. Abdo, R. Elzohery, and J. Roberts , Analysis of the lra reactor benchmark using dynamic mode decomposition , Transactions of the American Nuclear Society, 119 (2018), pp. 683–685.
2[2] M. Abdo, R. Elzohery, and J. A. Roberts , Modeling isotopic evolution with surrogates based on dynamic mode decomposition , Annals of Nuclear Energy, 129 (2019), pp. 280–288.
3[3] A. Alfonsi, A. Hummel, S. Sen, G. Strydom, and H. Gougar , Decay Heat Curve Generation for High Temperature Reactors Using Exponentials, Support Vector Machines and Dynamic Mode Decomposition Within the RAVEN Framework , Trans. Am. Nucl. Soc., 118 (2018).
4[4] N. Andersson and L.-E. Eriksson , A novel solver acceleration technique based on dynamic mode decomposition , in 6th European Conference on Computational Fluid Dynamics (ECFD VI), 20-25 July 2014, Barcelona, Spain, 2014, pp. 4832–4851.
5[5] M. Budišić, R. Mohr, and I. Mezić , Applied koopmanism , Chaos: An Interdisciplinary Journal of Nonlinear Science, 22 (2012), p. 047510.
6[6] A. C. Center , Benchmark problem book, report anl-7416 (suppl. 2) , tech. report, Argonne National Laboratory, Argonne, IL, 1977.
7[7] N. Demo, M. Tezzele, and G. Rozza , Py DMD: Python Dynamic Mode Decomposition , The Journal of Open Source Software, 3 (2018), p. 530, https://doi.org/https://doi.org/10.21105/joss.00530 . · doi ↗
8[8] J. J. Duderstadt and L. J. Hamilton , Nuclear reactor analysis , Wiley-Interscience, 1976.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Acceleration of the Power Method with Dynamic Mode Decomposition

Abstract

keywords:

1 Introduction

2 Methodology

2.1 The Power Method

2.2 Dynamic Mode Decomposition

2.3 An Accelerated Power Method using DMD

3 Numerical Results

3.1 Skipping Ahead with DMD-PM(nnn)

3.2 Application of Restarted DMD-PM(n)(n)(n)

3.3 Restarted DMD-PM(nnn) for Higher Modes

4 Conclusions

Acknowledgments

3.1 Skipping Ahead with DMD-PM( $n$ )

3.2 Application of Restarted DMD-PM $(n)$

3.3 Restarted DMD-PM( $n$ ) for Higher Modes