A non-intrusive reduced order modeling framework for quasi-geostrophic turbulence
Sk. Mashfiqur Rahman, Suraj Pawar, Omer San, Adil Rasheed, Traian, Iliescu

TL;DR
This paper introduces a non-intrusive reduced order modeling framework using LSTM neural networks for quasi-geostrophic turbulence, enabling stable, accurate predictions with fewer modes compared to traditional methods.
Contribution
The study develops a novel ROM framework that leverages LSTM for efficient, stable, and accurate prediction of complex fluid flows with fewer modes than conventional projection-based ROMs.
Findings
ROM-LSTM captures intermittent bursts accurately
ROM-LSTM achieves stable solutions with fewer POD modes
Outperforms traditional Galerkin projection ROMs in stability and efficiency
Abstract
In this study, we present a non-intrusive reduced order modeling (ROM) framework for large-scale quasi-stationary systems. The framework proposed herein exploits the time series prediction capability of long short-term memory (LSTM) recurrent neural network such that: (i) in the training phase, the LSTM model is trained on the modal coefficients extracted from the high-resolution data using proper orthogonal decomposition (POD) transform, and (ii) in the testing phase, the trained model predicts the modal coefficients for the total time recursively based on the initial time history. To illustrate the predictive performance of the proposed framework, the mean flow fields and time series response of the field values are reconstructed from the predicted modal coefficients by using an inverse POD transform. As a representative benchmark test case, we consider a two-dimensional…
| Parameters | Values |
|---|---|
| Number of hidden layers | 6 |
| Number of neurons in each hidden layer | 40 |
| Batch size | 16 |
| Epochs | 500 |
| Activation functions in the LSTM layers | tanh |
| Training-testing ratio | 4:9 |
| Validation data set | 20 |
| Loss function | MSE |
| Optimizer | ADAM |
| Modal coefficient | Hurst exponent |
|---|---|
| 0.52 | |
| 0.35 | |
| 0.63 | |
| 0.59 | |
| 0.49 | |
| 0.59 | |
| 0.59 | |
| 0.46 | |
| 0.58 | |
| 0.53 |
| Vorticity | Streamfunction | |
|---|---|---|
| Intrusive ROM | ||
| ROM-GP () | ||
| ROM-GP () | ||
| ROM-GP () | ||
| ROM-GP () | ||
| ROM-GP () | ||
| Non-intrusive ROM | ||
| ROM-LSTM () | ||
| ROM-LSTM () | ||
| ROM-LSTM () | ||
| ROM-LSTM () | ||
| ROM-LSTM () | ||
| ROM-LSTM | Training time (s) | Testing time (s) |
|---|---|---|
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
A non-intrusive reduced order modeling framework for quasi-geostrophic turbulence
Sk. M. Rahman
S. Pawar
O. San
School of Mechanical and Aerospace Engineering, Oklahoma State University, Stillwater, OK 74078, USA
A. Rasheed
Department of Engineering Cybernetics, Norwegian University of Science and Technology, N-7465, Trondheim, Norway
T. Iliescu
Department of Mathematics, Virginia Tech, Blacksburg, VA 24061, USA
Abstract
In this study, we present a non-intrusive reduced order modeling (ROM) framework for large-scale quasi-stationary systems. The framework proposed herein exploits the time series prediction capability of long short-term memory (LSTM) recurrent neural network architecture such that: (i) in the training phase, the LSTM model is trained on the modal coefficients extracted from the high-resolution data snapshots using proper orthogonal decomposition (POD) transform, and (ii) in the testing phase, the trained model predicts the modal coefficients for the total time recursively based on the initial time history. Hence, no prior information about the underlying governing equations is required to generate the ROM. To illustrate the predictive performance of the proposed framework, the mean flow fields and time series response of the field values are reconstructed from the predicted modal coefficients by using an inverse POD transform. As a representative benchmark test case, we consider a two-dimensional quasi-geostrophic (QG) ocean circulation model which, in general, displays an enormous range of fluctuating spatial and temporal scales. We first illustrate that the conventional Galerkin projection based reduced order modeling of such systems requires a high number of POD modes to obtain a stable flow physics. In addition, ROM-GP does not seem to capture the intermittent bursts appearing in the dynamics of the first few most energetic modes. However, the proposed non-intrusive ROM framework based on LSTM (ROM-LSTM) yields a stable solution even for a small number of POD modes. We also observe that the ROM-LSTM model is able to capture quasi-periodic intermittent bursts accurately, and yields a stable and accurate mean flow dynamics using the time history of a few previous time states, denoted as the lookback time-window in this paper. Throughout the paper, we demonstrate our findings in terms of time series evolution of the field values and mean flow patterns, which suggest that the proposed fully non-intrusive ROM framework is robust and capable of predicting noisy nonlinear fluid flows in an extremely efficient way compared to the conventional projection based ROM framework.
I Introduction
Large-scale turbulent flows, such as atmospheric and geophysical flows, are nonlinear dynamical systems which exhibit an enormous range of complex, coherent spatio-temporal scales. Over the past half century, computational approaches have made a significant contribution to characterize and understand the behavior of such flow phenomena. To resolve physical problems with high spatio-temporal variabilities through numerical simulation, one needs a high-fidelity modeling technique like direct numerical simulation (DNS). However, a huge amount of computational resources are required to capture the fine details of the flow dynamics which can become inefficient and unmanageable after some level of accuracy. Although there has been a continuous growth in computer power and performance following Moore’s law during the past few decades Mack (2011), the progress has started to stagnate in the recent years Powell (2008); Waldrop (2016); Theis and Wong (2017); Kumar (2018). As a result, one of the most active research areas in modeling of turbulent flow dynamics is the development of efficient and robust algorithms that aim at achieving the maximum attainable quality of numerical simulations with optimal computational costs. Indeed, computational costs can be reduced by using low-fidelity models such as large eddy simulation (LES) Sagaut (2006) and Reynolds-averaged Navier-Stokes (RANS) Tennekes et al. (1972) with additional approximations in the governing equations to neglect some of the physical aspects for closure modeling. Even so, these techniques require parameter calibration to approximate the true solution to any degree of confidence and may thus increase costs related to model validation, benchmark data generation, and efficient analysis of the generated data sets. As an alternative to the existing techniques, the reduced order modeling (ROM) approach has quickly become a promising approach to reduce the computational burden of high-fidelity simulations. In general, ROM works in such a way that the high-dimensional complex dynamical systems will be represented with much lower-dimensional (but dense) systems while keeping the solution quality within the acceptable range Quarteroni et al. (2015); Taira et al. (2017). An introduction to ROM methodologies can be found in recent review articles Benner et al. (2015); Taira et al. (2017); Rowley and Dawson (2017).
There have been a significant number of strategies proposed over the years to obtain ROMs of nonlinear dynamical systems. These ROM techniques have been utilized for a wide variety of applications related to, e.g., flow control Noack et al. (2011); Brunton and Noack (2015); Ito and Ravindran (1998), data assimilation in weather and climate modeling Daescu and Navon (2008); Cao et al. (2007), and uncertainty quantification Ullmann and Lang (2014); Chen and Quarteroni (2015); Haasdonk et al. (2013). Among the different variants of ROM strategies, the Galerkin projection combined with proper orthogonal decomposition (POD) based ROMs (ROM-GP) have been utilized extensively in various areas Carlberg et al. (2011); Lucia et al. (2004); Bistrian and Navon (2015); Brenner et al. (2012); Burkardt et al. (2006); Iollo et al. (2000); Freno and Cizmas (2014). POD, also known as principal component analysis (PCA), is a mathematical technique to extract the dominant statistical characteristics from turbulent flow fields by identifying the most energetic modes Lumley (1967); Antoulas et al. (2001); Chinesta et al. (2011); Benner et al. (2015); Rowley and Dawson (2017). These few POD modes possess the fine-scale details of the system and have the capability of representing the true physics accurately. Over the years, considerable work has been done to improve the regular POD approaches Hesthaven et al. (2015); Lassila et al. (2013); Quarteroni et al. (2015); Berkooz et al. (1993); Taira et al. (2017); Aubry et al. (1992); Benner et al. (2015); Rowley et al. (2004); Noack et al. (2011); Sieber et al. (2016).
In general, POD uses the data obtained from experiments or high-fidelity numerical simulations and generates an orthonormal set of spatial basis vectors describing the main directions (modes) by which the flow is represented optimally, in an sense Berkooz et al. (1993). The most energetic modes are kept to generate the reduced order system while the other modes are truncated. However, it has been observed that the discarded modes often contribute to the evolving dynamics of large-scale complex turbulent flow systems, like the geophysical flows Lassila et al. (2014), resulting in instabilities and modeling errors in the solution Akhtar et al. (2009); Rempfer (2000); Rowley and Williams (2006); Iollo et al. (2000). Thus, several research efforts have been devoted to improve the stability of ROM-GP frameworks by addressing the truncated modes contributions Aubry et al. (1988); Rempfer and Fasel (1994a, b); Cazemier et al. (1998); Xie et al. (2017). Noack et al. Noack et al. (2003) proposed an extra ‘shift-mode’ for accurate representation of the unstable steady solution. Several closure modeling ideas are devised to resolve the weak dissipation associated with POD modes by introducing eddy-viscosity terms (similar to LES eddy-viscosity models) Sirisup and Karniadakis (2004); Cordier et al. (2013); Östh et al. (2014). San and Iliescu San and Iliescu (2014) improved the ROM performance by finding an optimal value for eddy-viscosity parameter with the assumption that the amount of dissipation is not identical for all the POD modes. In our recent work, we proposed an automated approach to find the eddy-viscosity parameter dynamically to stabilize the ROM-GP model Rahman et al. (2019). An alternative approach to find the eddy-viscosity parameter dynamically has been proposed by using an extreme learning machine architecture San and Maulik (2018a). With the growing interest in data-driven modeling of ROMs using machine learning (ML) architectures, there has been another dimension of research introduced to the community for the improvement in ROM performance, referred as hybrid ROM approach. Generally, the hybridization is done by combining an imperfect physics-based model with a data-driven technique to get a hybrid scheme, and it is observed that the hybrid model shows better predictive performance than the component models Rahman et al. (2018); Wan et al. (2018); Xie et al. (2018); Pathak et al. (2018).
In this paper, we develop a fully non-intrusive ROM approach as a potential alternative to already existing ROM methodologies. Indeed, physics-based (intrusive) ROM frameworks require an approximation of stabilization or regularization parameters and depend on underlying governing equation to get the solution. On the other hand, the hybrid approaches require computation of both intrusive and non-intrusive contributions, which can make the overall computation expensive. However, it is well-known that a non-intrusive approach can make the framework greatly efficient when it can be implemented successfully. With the abundance of massive amounts of data resources from high-fidelity simulations, field measurements, and experiments, the data-driven modeling approaches are currently considered some of the most promising methods across different scientific and research communities. In the past few years, artificial neural networks (ANNs) and other ML techniques have started a revolution in turbulence modeling community Raissi et al. (2019); Maulik and San (2017); Raissi et al. (2018); Lee et al. (1997); Maulik et al. (2018); Faller and Schreck (1997); San and Maulik (2018b); Wang et al. (2019); Moosavi et al. (2015); Kani and Elsheikh (2017). Interested readers are directed to Refs. Brunton et al. (2019); Kutz (2017); Durbin (2018); Duraisamy et al. (2018); Gamboa (2017) for more on the influence of ML on fluid mechanics, specifically turbulence modeling.
With a goal to develop an efficient and robust non-intrusive ROM framework for large-scale quasi-stationary systems like geophysical flows, we propose a methodology based on long short-term memory (LSTM) recurrent neural networks. Since reduced order modeling of such noisy large-scale systems is comparatively difficult due to instabilities, which results in using a very large number of POD modes to capture the true physics, our main motivation in this study is to utilize the time series prediction capability of LSTM Gers et al. (2002); Wang et al. (2018); Yeo and Melnyk (2019); Sak et al. (2014); Gamboa (2017) to capture the flow physics with a very few POD modes. As detailed in Ref. Yeo and Melnyk (2019), LSTM is very robust in predicting a very noisy sequential time series. In general, for this type of random time series, LSTM does the prediction using its own internal dynamics, which is found stable and close to the true solution Yeo and Melnyk (2019); Mohan and Gaitonde (2018). For this reason, we choose to utilize LSTM architecture based on our problem of interest, which is the large-scale quasi-stationary turbulence. However, we emphasize that this non-intrusive model can be developed by using other relevant neural network architectures as well. We also mention that the development of ROM using POD and LSTM has been used only in a few other works and proven to be successful in capturing the temporal dynamics of fluid flows. Wang et al. Wang et al. (2018) proposed a non-intrusive ROM (NIROM) based on LSTM and used it to predict laminar flows. In another recent work, Vlachas et al. Vlachas et al. (2018) proposed a data-driven method based on LSTM to predict the state derivative of chaotic systems using the short-term history of the reduced order states. The predicted derivatives are then used for one-step forward prediction of the high-dimensional dynamics. The authors further developed a hybrid framework combining mean stochastic model and LSTM for data-driven to extend the forecasting capability of the proposed approach. To do the dimensionality reduction, the authors utilized discrete Fourier transform, singular value decomposition, and empirical orthogonal functions. Mohan and Gaitonde Mohan and Gaitonde (2018) developed a non-intrusive ROM using LSTM and POD for flow control applications through a detailed analysis on different ROM-LSTM training and testing hyper-parameter tuning parameters. Even though the authors’ idea of developing non-intrusive ROM based LSTM by replacing Galerkin projection is similar to our present work, their work is mostly focused on exploring the capability of LSTM in modeling the flow in reduced order space for data sets with less randomness. Indeed, the data sets with less randomness have more “memory” in it, i.e., there are persistent or anti-persistent trends and thus, are more controllable through LSTM hyper-parameters. On the other hand, our present work is focused on exploring the capability of ROM-LSTM framework in resolving large-scale geophysical flow problem where the data sets mostly do not follow any particular trend. To this end, we develop a modular ROM-LSTM approach in chaotic and quasi-stationary systems to see whether it can overcome the instability issues associated with conventional ROMs for noisy dynamical systems. To assess our proposed framework, we consider the barotropic vorticity equation (BVE) representing the single-layer quasi-geostrophic (QG) model as an example of the quasi-stationary system. We observe a remarkably efficient predictive performance by the proposed framework based on LSTM (ROM-LSTM) through a number of numerical experiments and analyses.
The layout of the paper is as follows: Section II provides an overview of the barotropic vorticity equation describing a single-layer QG ocean model. In Section III, dimension reduction through Galerkin-projection and proper orthogonal decomposition is illustrated briefly. Our proposed non-intrusive ROM-LSTM framework with a brief introduction to LSTM are presented in Section IV. In Section V, we evaluate the predictive performance of the proposed ROM framework with respect to the standard ROM and full order model solutions. Finally, Section VI provides a summary of this study and the conclusions drawn from it.
II Single-layer quasi-geostrophic (QG) Ocean Circulation Model
In the present study, we consider the simple single-layer QG ocean circulation model to develop and assess the performance of different ROM approaches. Following Refs. Holm and Nadiga (2003); San et al. (2011), we consider the single-layer QG problem as a benchmark for wind-driven, large-scale oceanic flow. Wind-driven flows of mid-latitude ocean basins have been studied frequently by modelers using idealized single- and double-gyre wind forcing, which helps in understanding various aspects of ocean dynamics, including the role of mesoscale eddies and their effect on mean circulation. However, modeling the vast range of spatio-temporal scales of the oceanic flows with all the relevant physics has always been challenging. As a result, the numerical simulation of oceanic and atmospheric flows still requires approximations and simplifications of the full model. The barotropic vorticity equation (BVE) describing the single-layer QG equation with dissipative and forcing terms is one of the most commonly used models for the double-gyre wind-driven geophysical flows Majda and Wang (2006).
The BVE model shares many features with the two-dimensional Euler and Navier-Stokes equations and has been extensively used over the years to describe various aspects of the largest scales of turbulent geophysical fluid dynamics Holland and Rhines (1980); Munk and Wunsch (1982); Griffa and Salmon (1989); Cummins (1992); Greatbatch and Nadiga (2000); Nadiga and Margolin (2001). Using plane assumption reasonable for most oceanic flows, the dimensionless vorticity-streamfunction formulation of the forced-dissipative BVE can be written as Rahman et al. (2018):
[TABLE]
where is the standard two-dimensional Laplacian operator. and are the kinematic vorticity and streamfunction, respectively, defined as:
[TABLE]
[TABLE]
where is the two-dimensional velocity field and refers to the unit vector perpendicular to the horizontal plane. The kinematic equation connecting the vorticity and streamfunction can be found by substituting the velocity components in terms of streamfunction in Equation (2), which yields the following Poisson equation:
[TABLE]
Equation (1) contains two dimensionless parameters, Reynolds number (Re) and Rossby number (Ro), which are related to the physical parameters and non-dimensional variables in the following way:
[TABLE]
where is the horizontal eddy viscosity of the BVE model and is the gradient of the Coriolis parameter at the basin center (). is the basin length scale and is the velocity scale, also known as the Sverdrup velocity Sverdrup (1947), and is given by
[TABLE]
where is the maximum amplitude of the double-gyre wind stress, is the mean fluid density, and is the mean depth of the ocean basin. Despite not being explicitly represented in Equation (1), there are two important relevant physical parameters, the Rhines scale, , and the Munk scale, , which are the non-dimensional boundary layer thicknesses for the inertial and viscous (Munk) layer of the basin geometry, respectively. As a physical interpretation of these parameters in BVE model, accounts for the strength of nonlinearity and is a measure of dissipation strength. and can be defined as
[TABLE]
and are related to Ro and Re by the following relations
[TABLE]
Finally, the nonlinear advection term in Equation (1) is given by the Jacobian
[TABLE]
Since ocean circulation models where the Munk and Rhines scales are close to each other, like the QG model, remain time dependent rather being converged to a steady state as time approaches to infinity Medjo (2000), numerical computations of these models are conducted in a statistically steady state, also known as the quasi-stationary state. Hence, in our study, we utilize numerical schemes suited for simulation of such type of ocean models and for long-time integration. Details of the relevant numerical discretization schemes, Poisson solver, and problem definitions for this study can be found in elsewhere Rahman et al. (2018); San et al. (2011); San (2016).
III Intrusive ROM-GP Methodology
The intrusive ROM framework is developed based on a standard Galerkin projection methodology using the method of snapshots, an efficient method for computing the POD basis functions Sirovich (1987). In this section, we give a brief idea on the ROM-GP framework utilized in our work. We obtain number of snapshots for vorticity field, for at pseudo-time from full order model simulation (FOM). Algorithm 1 describes the POD basis construction procedure from the stored snapshots.
We can approximate the field variables, i.e., kinematic vorticity and streamfunction using the most energetic POD modes, where , such that these largest energy containing modes correspond to the largest eigenvalues (). The resulting full expression for the field variables can be written as:
[TABLE]
where accounts for both streamfunction and vorticity based on the kinematic relation given by Eq. (4). It should be mentioned that in our ROM formulations, we use the following angle-parenthesis definition for the inner product of two arbitrary functions and :
[TABLE]
We refer to Rahman et al. (2019) for the details of the integration technique utilized in this study. In conventional projection based intrusive ROM framework, we apply Galerkin Projection to the governing equation, which yields coupled ordinary differential equations (ODEs) for the time evolution of the temporal modes of the system while the spatial modes are kept constant Kunisch and Volkwein (2002); Rowley et al. (2004); Aubry et al. (1988). Any standard time integration technique can be utilized to solve the coupled ODE system, since the basis functions and corresponding modal coefficients will be precomputed in the offline computation stage. The Galerkin projection approach is summarized in Algorithm 2.
IV Non-intrusive ROM-LSTM Methodology
In this section, we discuss our proposed ROM-LSTM methodology. As outlined in Algorithm 1, we obtain the time-dependent modal coefficients by performing a POD transform on stored snapshot data. The modal coefficients are a sequence of data points with respect to time, i.e., a time series representing the underlying dynamical system. In intrusive or physics-based ROM, we do Galerkin projection using governing equation to obtain a coupled system of ODEs for , and then solve the ODE system on the given time interval. However, the limitations of projection based ROMs, such as susceptibility to instability for noisy data set, numerical constraints for solving ODE system, or inefficient reduced order modeling, encourage us to replace the physics-based Galerkin projection phase of ROM-GP methodology with a data-driven approach. Among the variety of ideas to resolve the issues associated with projection based ROM, a number of published works related to ROM based on POD and neural networks have shown signs of future success. The recurrent neural network (RNN) is one of the widely used neural network architectures in ROMs which is designed to operate on input information as well as the previously stored observations to predict the dependencies among the temporal data sequences Jaeger and Haas (2004); LeCun et al. (2015). LSTM is a special variant of RNN which is capable of tracking long-term dependencies among the input data sequences. Hence, we consider LSTM recurrent neural network to develop our non-intrusive ROM-LSTM framework. Before describing the ROM-LSTM procedure, we first briefly review the LSTM architecture.
As the name suggests, RNNs contain recurrent or cyclic connections that enable them to model complex time-varying data sequences with a wide range of temporal dependencies or correlations between them. In general, RNNs map a sequence of data to another sequence through time using cyclic connections, and constrain some of the connections to hold the same weights using back-propagation algorithm Rumelhart et al. (1988). However, the standard RNN architecture suffers from vanishing gradient problem when the gradient of some weights starts to become too small or too large Bengio et al. (1994). This leads to the development of improved RNN architectures which overcome the modeling issues of standard RNNs. One of the most successful forms of improved RNN architectures is the LSTM network, which solves the limitation of vanishing gradients Hochreiter and Schmidhuber (1997). In contrast to most of the ANN architectures, LSTM operates by cell states and gating mechanisms to actively control the dynamics of cyclic connections and thus, resolves the vanishing gradient issues. Similar to the standard RNNs, LSTM can learn and predict the temporal dependencies based on the input information and previously acquired information, i.e., the internal memory of LSTM allows the network to find the relationship between the current input and stored information to make a prediction.
The conventional LSTM architecture contains memory blocks in the recurrent hidden layers, which have memory cells to store the cell states and gates to control the flow of information. Each memory block has an input gate controlling the flow of input activations into the cell, a forget gate to adaptively forgetting and resetting the cell’s memory (to prevent over-fitting by processing continuous inflow of input streams), and the output gate controlling the output flow of cell activations into the next cell. In our LSTM architecture, we consider an input sequential data matrix for training, where is the lookback time-window. The lookback time-window, in our definition, means the time history over which the LSTM model does the training and prediction recursively. Indeed, increasing the value of increases the quality of training the model, but makes the model dependent on an increased number of initial states during prediction. The LSTM model is trained to an output sequential data matrix recursively (here, and ). Considering input gate as , the forget gate as , the output gate as , the cell activation vectors as , and the cell output as , the LSTM model does the mapping from the input states to an output state by using the following equations at any instance Sak et al. (2014); Yeo and Melnyk (2019):
[TABLE]
where represents the weight matrices for each gates, is the LSTM prediction, denotes the bias vectors for each gates, is the element-wise product of vectors, and is the logistic sigmoid function.
Similar to the ROM-GP methodology, the workflow of the ROM-LSTM framework consists of two phases as displayed in Figure 1. In the offline training phase, we first obtain POD basis functions and modal coefficients using Algorithm 1. The known time series of modal coefficients from training snapshots are used to train the LSTM model. Based on the values of , the input of the LSTM model will be the previous time states of the input modal coefficients for retained modes and the output of the model will be the next time state recursively for modes. Training LSTM model is the computationally heavier part of the ROM-LSTM framework, but this is done offline. In online testing phase, we recursively predict the modal coefficients for the total time using the trained model . The input of the trained model will be the initial states based on the preselected value of and the output will be recursive prediction of corresponding future time states. Thus, we bypass the physics-based Galerkin projection part with completely data-driven neural network approach to predict the modal coefficients. Also, the computational cost of prediction through trained LSTM network is significantly lower than the physics-based approach. Finally, we reconstruct the mean vorticity and streamfunction fields using inverse transform to analyze the behavior of the quasi-stationary flow. The key steps of the ROM-LSTM framework are outlined below in Algorithm 3.
To design our LSTM architecture for ROM-LSTM framework, we utilize Keras Chollet et al. (2015), a high level API designed for deep learning, combined with standard Python libraries. The FOM simulation for data snapshots generation and POD basis construction codes are written in FORTRAN programming language. We utilize a deep LSTM network stacking 6 LSTM layers with 40 neurons in each layer. The computational cost is found manageable in this deep architecture setup, which encourages us to perform all the numerical experiments with this same setup. The mean-squared error (MSE) is chosen as the loss function, and a variant of stochastic gradient descent method, called ADAM Kingma and Ba (2014), is used to optimize the mean-squared loss. The other relevant hyper-parameters utilized in our LSTM architecture are documented in Table. 1. The hyper-parameters are kept constant for all simulations to obtain a fair comparison between the results in different numerical experiment runs. It should be noted that the training data is normalized by the minimum and maximum of each time series to be in between the range .
V Numerical Results
The predictive performance of the ROM-LSTM framework is thoroughly examined in this section in terms of time series evolution of the modal coefficients and mean flow fields. It is well documented in literature that the ROM-GP framework is incapable of capturing mean flow dynamics for quasi-stationary flows using lower number of POD modes, and susceptible to instability San and Iliescu (2014, 2015). There have been a number of approaches proposed in previous literature to improve the ROM performance. One way to stabilize the ROM is by adding an empirical stabilization parameter based on the analogy between large eddy simulation and truncated modal projection Aubry et al. (1988); Wang et al. (2012). Later, it is found that the ROM performance further improves taking the optimal value for the stabilization parameter rather than selecting it arbitrarily Rempfer (1991); San and Iliescu (2014); Cazemier (1997); San and Iliescu (2015). In our previous work, we have shown that computing the stabilization parameter dynamically at each time step improve the ROM performance significantly Rahman et al. (2019). However, the proposed ROM-LSTM methodology has several advantages over the physics-based approaches, such as, no dependence on the underlying governing dynamical system to obtain the solution, i.e., the process is free of numerical constraints, no burden of adding stabilization parameter to account for instability issues and so on. To reach a conclusion about the performance of the ROM-LSTM framework, we compare ROM-LSTM predictions with the FOM simulation and the standard ROM-GP results. Moreover, we present the performance of the ROMs based on lookback time-window and LSTM training for different number of POD modes to show the robustness and capability of the proposed framework. Furthermore, we present the L2-norm errors to perform a quantitative assessment on the accuracy of the ROM-LSTM solutions with respect to ROM-GP solutions.
We choose the single-layer QG problem as our test bed to evaluate the performance of ROMs. Because of the complex flow behavior with wide range of scales, QG problem has been utilized as test problem in many notable literature Cushman-Roisin and Beckers (2011); Holm and Nadiga (2003); San et al. (2011); Cummins (1992); Greatbatch and Nadiga (2000). To make the analyses simple and easily understandable, we present simulation results only for Re and Ro flow condition, which can be considered turbulent enough and suitable for reduced order modeling. The FOM simulation is done from to using a constant time step of on a Munk layer resolving grid resolution (i.e., consisting of about four grid points in the Munk scale, i.e., ). To avoid the initial transient time interval, we store data snapshots from to to generate the POD bases and modal coefficients to train the LSTM model. We refer to Ref. Rahman et al. (2019) to get an idea on the POD analysis as well as the instantaneous vorticity field plots for the same flow condition. The computational domain of our test problem is . To understand the nature of the QG data set, we compute the Hurst exponent, , for the modal coefficients. The Hurst exponent is a statistical measure of the presence of long-term trends in a non-stationary time series Hurst (1951). Thus, the Hurst exponent can help in selecting the appropriate model for a given time series prediction. We also note that the Hurst exponent has been utilized in many research fields, e.g., hydrology, finance, and healthcare indsutry Chandra et al. (2013); Koutsoyiannis (2003); Lahmiri and Boukadoum (2014); Bassler et al. (2006); Carbone et al. (2004). can be statistically defined as Qian and Rasheed (2004):
[TABLE]
Here, E is the expected value of the ratio between the range of the first cumulative deviations from the mean and their corresponding standard deviations (SD), is the time span of the observations, and is constant. The range of is in between 0 and 1. means a persistent series (a strong trend in the time series at hand), means an anti-persistent series (a time series with long-term switching between high and low values) and indicates a random series (fewer correlations between current and future observations). Interested readers are directed to Ref. Mohan and Gaitonde (2018) for a detailed description of suitability of LSTM as a predictive modeling approach for different time series data using the measurement of . We calculate the for modal coefficients of QG data set for given flow conditions using the so called rescaled range (R/S) analysis, popularized by Mandelbrot and Wallis Mandelbrot and Wallis (1968); Mandelbrot and Wallis (1969). The details of (R/S) analysis can be found in Ref. Qian and Rasheed (2004). The Hurst exponents for the modal coefficients of QG case are tabulated in Table 2, where we can see that the values of are around 0.5. This indicates the randomness of the QG problem, which can be a good representative of large-scale quasi-stationary geophysical turbulent flow systems.
Figure 2 shows the mean streamfunction and vorticity field contours obtained by the ROM-GP model. To compare the predictive performance of the ROM-GP model with respect to the true solution, we include the mean contour plots of FOM simulation on the left column as well. We can see the full order solution displays a four-gyre circulation patterns for both mean streamfunction and vorticity fields. Since the instantaneous fields for the QG flow is always fluctuating in time, it becomes difficult to compare solutions of different models at the same time state. However, the mean fields always exhibit the four-gyre circulation for higher Re (highly turbulent regime, i.e., turbulence with weak dissipation) which implies a state of turbulent equilibrium between two inner gyres circulation representing the wind stress curl forcing and the outer gyres representing the eddy flux of potential vorticity (the northern and southern gyres found in geostrophic turbulence) Greatbatch and Nadiga (2000). In our study, the time-averaged (mean) field data are obtained by averaging between to . Another point to be noted in FOM field plots that the bright orange circulations in the four-gyres (top circulation of the inner gyres and bottom circulation of the outer gyres) indicate the circulation in counter clockwise or positive direction and the other two circulations signifies the circulation in clockwise direction. We can observe in Figure 2 that the ROM-GP simulations with and modes display a non-physical two-gyre circulation for streamfunction whereas the vorticity field does not capture almost any conclusive physical pattern. However, the results improve with increasing modes as we can see the streamfunction contour is showing clear four-gyre patterns even though the vorticity plots are very noisy compared to the true solution. These observations are supported by the time series evolution of first and tenth modal coefficient plots in Figure 3. It is apparent that the increasing modes stabilize the noise to reach a physical solution for both modal coefficients.
We note that the time scale in our formulation is normalized by to obtain dimensionless time unit. Following San and Staples (2013), typical oceanic values (e.g., km and m*-1s-1*) yield approximately year for . Therefore, a numerical simulation over 100 computational time units refers to the evolution of flow dynamics over 25 years in physical time. Therefore, the intermittent bursts appeared in the true projection of the most energetic mode (i.e., indicate the seasonal variations in QG dynamics. Although ROM-GP yields non-physical solution for and cases, series reaches more meaningful levels for and beyond. However, it is hard to claim from Figure 3 that the ROM-GP yields an accurate prediction of these seasonal bursts even for higher values.
We present the field contours obtained by ROM-LSTM based on different values in Figure 4. It can be seen that and do not provide much accurate results as the patterns get distorted in some extent even though they are being able to capture the four-gyre. However, both streamfunction and vorticity contours show a stable and accurate prediction of the true mean fields for and . Though the vorticity field contour is not displaying as smooth contour lines as the true solution due to the reduction of dimension order, it is showing a better performance compared to the ROM-GP solutions. As shown in the recent work of Yeo Yeo (2019), the LSTM network trained on a noisy data learns to reduce the contributions of noisy input data by developing its own dynamics and thus, the prediction remains close to the truth rather than being unstable due to noisy input data. Hence, the LSTM prediction is expected to yield a stable and physical solution for a fluctuating quasi-stationary system. It should be noted that these results are obtained for LSTM training with modes. The time series evolution plots for the modal coefficients based on and modes in Figure 5 show that ROM-LSTM time series predictions are almost on top of the true projection of modal coefficients. Even though the model is trained for to only, the ROM-LSTM model is able to obtain a stable and accurate prediction up to the final time .
Another impressive observation on the predictive capability of the ROM-LSTM framework is presented in Figure 6 where we show the mean field plots based on the number of modes retained to train the LSTM model. We keep for this numerical experiment. As we can see the ROM-LSTM model is being able to capture the four-gyre circulation even with only two modes. Indeed, the first few modes contain most of the dynamics in the system and we can also see reduction of some smaller scales for lower mode predictions. Nevertheless, this finding indicates the prediction capability of the ROM-LSTM framework to produce a stable solution of a noisy system. However, we have seen the ROM-GP model becomes unstable to predict noisy data set with lower number of modes which makes it very inefficient. In contrast, the proposed non-intrusive framework can be very efficient to produce stable solution with a very few modes. Since we observe promising predictive performance for training with 2 modes only, we present a couple of more analyses on results obtained by the ROM-LSTM framework retaining 2 modes for LSTM training. We can see in Figure 7 that lower values simulations are unable to capture the fluctuations along the mean and goes almost straight along the line after a few time states. The model starts to capture the fluctuating flow fields with the increase of values. The field plots in Figure 8 also displays the similar conclusions. Since the lower value solutions stay along the line around the mean (unlike rapid oscillations in ROM-GP solutions), the field plots still show the mean physics to some extent. It is obvious that the model with lower ignores most of the scales of the system. However, the prediction improves with higher as displayed in the Figure 8.
Finally, we include a comparison plot in Figure 9 where we present the first two modal coefficients prediction obtained by different ROM set up. The value is kept 5 for all the ROM-LSTM simulations. As expected, the ROM-GP solutions for 10 modes become totally non-physical and unstable. On the contrary, the ROM-LSTM predictions for , , and modes show a good match between the true solution and the prediction. For the quantitative assessment on the accuracy of both ROM-GP and ROM-LSTM frameworks, L2-norm errors of the reduced order models (with respect to FOM) for the mean vorticity and streamfunction fields are tabulated in Table 3. The root mean-square error or Euclidean L2-norm error is computed by:
[TABLE]
where and are the grid resolutions in and directions. For the vorticity field, the error i.e., the difference between the predicted mean and FOM solution mean is:
[TABLE]
For ROM-LSTM framework, the results are presented for modes. We can observe that the prediction accuracy increases with the increase in lookback time-window and we can obtain a more accurate result than the ROM-GP simulation with using only modes in ROM-LSTM framework. We present the CPU time per time step (between and ) for ROM-LSTM framework simulations based on modes and different in Table 4. We can observe a gradual reduction of computational time (for both training and testing) with lower values of . All the simulations of ROM-LSTM frameworks are done in Python programming environment and CPU time is computed as per time step. The computational time step is set to for online testing. In our FOM simulation in FORTRAN, the CPU time is about (between and ), where computational time step is set due to the CFL restriction of numerical stability for our explicit forward model on the resolution of . We refer to Ref. Rahman et al. (2019) to get an idea about the computational overhead for ROM-GP frameworks using same flow conditions. It should be mentioned that the ROM-GP computations are done using FORTRAN programming platform. Even so, we observe our ROM-LSTM CPU times are in the same order of ROM-GP simulations with modes. In Table 4, we show the computational cost for modes only since the CPU time (both training and testing) for other ROM-LSTM runs using different modeling conditions are in same order.
VI Summary and Conclusions
In this paper, we propose an efficient and robust fully non-intrusive ROM framework to capture the large spatio-temporal scale of fluctuating quasi-stationary systems. Due to the robustness and stability of LSTM recurrent neural network in predicting noisy dynamical systems, we consider LSTM architecture to develop our data-driven ROM, denoted as ROM-LSTM in this paper. As an example of large-scale turbulent flows exhibiting a wide range of spatio-temporal scales, we investigate the reduced order modeling of a simple general ocean circulation model, single-layer QG turbulence, to assess the predictive performance of our proposed ROM-LSTM framework. It was previously observed that the conventional physics-based (or intrusive) ROM of QG model requires a large number of POD modes to yield stable and physical flow dynamics. However, the proposed ROM-LSTM framework shows a very promising improvement in reduced order modeling that only a few modes are able to capture a physical solution without any prior knowledge about the underlying governing equations. We first demonstrate that the conventional Galerkin projection ROM approach yields non-physical predictions when we use a small number of representative modes. Although ROM-GP converges to a more physical solution when increasing the number of modes, it does not seem to capture the intermittent bursts appearing in the dynamics of the first few most energetic modes. However, the proposed ROM-LSTM approach is able to capture these bursts and yields remarkably accurate results even when using a small number of modes.
The proposed methodology consists of two phases: offline training and online testing or prediction phase. Initially, we collect the high-fidelity simulation or experimental data snapshots for a certain flow condition. The data snapshots are collected up to a certain time of the full order model simulation for training. Then we do a mapping of the high-resolution instantaneous data snapshots into a reduced order, i.e., low-dimensional space through POD transform. In this process, we generate POD basis functions of the field variables and time dependent modal coefficients for training the LSTM architecture. The LSTM architecture is trained for the modal coefficients based on a preselected lookback time-window, . In the online phase, the trained model is used to predict the modal coefficients recursively for the total time based on initial time history and . Finally, we reconstruct the mean fields for analyses using the predicted coefficients, precomputed basis functions, and mean field values.
We demonstrate the performance of the ROM-LSTM through time series evolution of modal coefficients and mean vorticity and streamfunction fields. To assess the performance of the proposed model, the ROM-LSTM predictions are compared with the high-dimensional solutions as well as with the conventional Galerkin projection based ROM (ROM-GP) solutions. We find that the ROM-LSTM predictions are stable and accurate even with only a couple of POD modes. On the other hand, the ROM-GP framework, as expected, requires a very large number of modes to obtain a physically stable solution, since the ROM-GP framework is susceptible to numerical instability in quasi-stationary flow fields. We further observe that the ROM-LSTM framework gives accurate and physical predictions based on a few time history data points. Indeed, if we increase the value of , the prediction accuracy will increase, but the computational cost of offline training and online prediction will also go up. To quantify the accuracy of the prediction of ROM-LSTM framework, we present the L2-norm errors for ROM-GP and ROM-LSTM frameworks, which show that the proposed framework trained with 10 modes and gets a better accuracy than the ROM-GP predictions with 40 or 80 modes.
Based on our findings, we conclude that the ROM-LSTM framework is a better tool than the ROM-GP framework for large-scale quasi-stationary flows in terms of prediction and reduced order modeling. Since the ROM-LSTM framework is fully non-intrusive, it does not rely on the governing equations to obtain the solution, which means that there are no numerical constraints while predicting the solutions. Additionally, it is computationally more efficient to predict the solution using a trained model rather than the physics-based approach of solving ODEs. Hence, the proposed ROM-LSTM framework can be considered a very promising approach in developing a robust and efficient ROMs for large-scale flows with noisy spatio-temporal behavior. In our future works, we will focus on testing the ROM-LSTM framework in more complex three-dimensional turbulent flow problems. We also plan to improve the existing framework based on our findings, and implement the updated framework in suitable ROM applications, such as, flow control, uncertainty quantification, and data assimilation.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Mack (2011) C. A. Mack, “Fifty years of Moore’s law,” IEEE Transactions on Semiconductor Manufacturing 24 , 202 (2011).
- 2Powell (2008) J. R. Powell, “The quantum limit to Moore’s law,” Proceedings of the IEEE 96 , 1247 (2008).
- 3Waldrop (2016) M. M. Waldrop, “The chips are down for Moore’s law,” Nature News 530 , 144 (2016).
- 4Theis and Wong (2017) T. N. Theis and H.-S. P. Wong, “The end of Moore’s law: a new beginning for information technology,” Computing in Science & Engineering 19 , 41 (2017).
- 5Kumar (2018) S. Kumar, in High-Speed and Lower Power Technologies: Electronics and Photonics , edited by J. H. Choi (CRC Press, Florida, United States, 2018).
- 6Sagaut (2006) P. Sagaut, Large eddy simulation for incompressible flows: an introduction (Springer Science & Business Media, Berlin, Germany, 2006).
- 7Tennekes et al. (1972) H. Tennekes, J. L. Lumley, J. Lumley, et al., A first course in turbulence (MIT press, Massachusetts, United States, 1972).
- 8Quarteroni et al. (2015) A. Quarteroni, A. Manzoni, and F. Negri, Reduced basis methods for partial differential equations: an introduction , vol. 92 (Springer, New York, United States, 2015).
