Multi-Modal Data-Driven Bayesian-Optimized CNN-LSTM Model for Slope Displacement Prediction

Xingwang Zhao; Xinlong Wan; Jian Chen; Chao Liu; Chao Chen

PMC · DOI:10.3390/s26051452·February 26, 2026

Multi-Modal Data-Driven Bayesian-Optimized CNN-LSTM Model for Slope Displacement Prediction

Xingwang Zhao, Xinlong Wan, Jian Chen, Chao Liu, Chao Chen

PDF

Open Access

TL;DR

A new model combining CNN and LSTM with Bayesian optimization improves slope displacement predictions using multi-modal data like rainfall and earth pressure.

Contribution

A novel Bayesian-optimized CNN-LSTM model that fuses multi-modal data for enhanced slope displacement prediction accuracy and stability.

Findings

01

The model achieved an average R2 of 0.971 with MAE of 0.444 mm and RMSE of 0.618 mm.

02

Compared to other models, it reduced MAE and RMSE by up to 32.3% and 29.5%, respectively.

03

Extrapolation prediction accuracy improved by 30.2% in MAE and 24.6% in RMSE using rainfall and earth pressure data.

Abstract

What are the main findings? A multimodal data-driven Bayesian optimized CNN-LSTM prediction model was constructed, which significantly improved the accuracy and stability of slope displacement time series prediction.The study verified that fusing multimodal data such as rainfall and earth pressure can effectively enhance the model’s ability to represent external influencing factors, thereby improving prediction stability. A multimodal data-driven Bayesian optimized CNN-LSTM prediction model was constructed, which significantly improved the accuracy and stability of slope displacement time series prediction. The study verified that fusing multimodal data such as rainfall and earth pressure can effectively enhance the model’s ability to represent external influencing factors, thereby improving prediction stability. What are the implications of the main findings? This provides a…

Figures11

Click any figure to enlarge with its caption.

Funding2

—Anhui Provincial Natural Science Foundation
—“the Fundamental Research Funds of the AUST”

Keywords

slope displacementmulti-modal dataBayesian optimization algorithmCNNLSTM

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLandslides and related hazards · Rock Mechanics and Modeling · Machine Learning and ELM

Full text

1. Introduction

The stability of mine slopes directly affects the safety, production, and ecological environment of mines [1]. As slope geometry and stress conditions continuously evolve, coupled with factors such as rainfall infiltration, weak structural planes, and complex geological formations, the risk of geological disasters such as mine slope instability significantly increases, posing a serious threat to people’s lives, property, and the environment [2,3]. Therefore, high-precision slope deformation monitoring and prediction are important research topics to ensure safe operation in mines [4,5,6].

At present, the research on slope deformation prediction mainly includes model-driven methods and data-driven methods [7,8]. The model-driven method establishes a geomechanical model or empirical statistical model to analyze the influence of relevant factors on deformation and makes predictions, such as expert empirical models and landslide mechanism models [9]. However, due to the complex geological conditions and susceptibility to the coupling effects of multiple factors, these methods are often limited in applicability and prediction accuracy. The data-driven method utilizes data mining techniques to process and analyze historical deformation monitoring data in order to predict surface displacement [10]. Given the coupled multi-factor influences and the nonlinear, dynamically evolving behavior of mine slopes [11]. With the development of deep learning, data-driven methods, such as Extreme Learning Machines (ELMs) [12], Support Vector Regression (SVR) [13], and Recurrent Neural Networks (RNNs) [14], have gradually gained people’s attention in the field of hazard monitoring due to their powerful feature extraction and nonlinear fitting capabilities.

The SVR model can effectively capture the complex nonlinear relationship between multiple factors and landslide displacement through a kernel function mapping the original feature space to a high-dimensional feature space. It can demonstrate good generalization capability even with limited monitoring data. The ELM model with minimal hyperparameter tuning can offer high training efficiency and is suitable for real-time slope displacement prediction. However, both SVR and ELM treat landslide displacement prediction as a static regression task, which limits their ability to capture the inherent dynamic characteristics in monitoring time series [15,16]. Considering the nonlinear and dynamic evolution characteristics of landslide displacement, many scholars are increasingly focusing on RNN, especially LSTM. The LSTM, with gate-controlled structure and memory cells, can capture long-term temporal dependencies in sequential data [17,18,19]. Studies showed that it had a higher prediction accuracy than traditional RNN, SVR, Back Propagation Neural Networks (BPNNs), and Support Vector Machines (SVMs) approaches in the displacement prediction for landslides and mine slopes [15,20,21]. Meanwhile, the CNN had also been applied to landslide displacement prediction due to its advantages in multi-modal monitoring data fusion and feature extraction. It can effectively learn the complex nonlinear relationships between internal and external factors in multi-modal data and accurately identify local deformation characteristics [18,22].

Although previous studies had shown the benefits of data-driven methods, single prediction models (such as LSTM, CNN, etc.), due to their simple structure, cannot fully extract the complex and nonlinear features in deformation monitoring data, which limited further improvements in prediction accuracy [23]. In contrast, hybrid modeling methods integrated the complementary advantages of multiple algorithms, thereby improving prediction performance [24]. Especially the CNN-LSTM model, benefiting from its synergistic capability of extracting spatial features and capturing temporal dependencies, was well-suited for processing landslide monitoring data with complex spatiotemporal characteristics and has demonstrated excellent predictive performance in this field [25,26,27]. These hybrid prediction models can improve the prediction accuracy of the model, but at the cost of requiring more hyperparameters. Some studies have explored hyperparameter optimization strategies based on intelligent optimization algorithms, including genetic algorithms [28], particle swarm optimization [29], and grey wolf optimization [30], etc. However, these algorithms were prone to local optima, slow convergence, and low computational efficiency [31].

Given the above, we constructed a Bayesian-optimized CNN-LSTM model for predicting mine slope displacement. This model effectively integrated the synergistic advantages of CNN and LSTM and introduced a Bayesian optimization algorithm to adaptively optimize the model hyperparameters, thereby constructing a fully slope displacement prediction framework. The study systematically evaluated the feasibility and effectiveness of this method in a typical mine slope displacement prediction and achieved more accurate displacement prediction based on multi-modal data, including GNSS displacement, rainfall, and earth pressure.

2. Materials and Methods

2.1. CNN-LSTM Model

The CNN-LSTM model achieves feature extraction and temporal modeling of multi-modal time series data by concatenating CNN and LSTM. Its network structure is shown in Figure 1.

In the network structure of CNN-LSTM, the CNN is located at the front end of the network, which consists of an input layer, several hidden layers, and an output layer. The hidden layers include convolutional layers, activation functions (such as ReLU), and pooling layers. The convolutional layer, as the core component [32], performs convolution operations on multivariate time series, using Equation (1).

[eqn]

where $[eqn]$ denotes output feature data, $[eqn]$ represents convolutional kernel weights, $[eqn]$ signifies the convolution operation, $[eqn]$ indicates input data, and $[eqn]$ is the bias term.

When predicting slope displacement, GNSS displacement, rainfall, and earth pressure are used as input variables for the CNN model. By stacking multiple convolutional layers, CNN can progressively extract deep features from raw monitoring data that reflect the evolutionary characteristics of slope displacement [33]. Then the pooling layer performs dimensionality reduction on the features through downsampling operations, thereby reducing computational complexity and enhancing the model’s adaptability to local perturbations and scale variations.

As the backend module of the CNN-LSTM model, LSTM is mainly used to model the high-dimensional features output by CNN in time series to reveal the long-term temporal dependence of the slope displacement evolution process. The LSTM networks are one of the most widely used types of RNNs [34]. By introducing memory units and gating mechanisms, it effectively overcomes the problems of gradient vanishing and exploding during training [35], enabling its extensive application in sequential prediction tasks. An LSTM unit consists of a forget gate, an input gate, and an output gate [36]. These gating units jointly regulate the retention and forgetting of information, consequently determining the update of the memory cell state. At each time step $[eqn]$ , the hidden state $[eqn]$ of the LSTM is updated based on the current input feature $[eqn]$ and the previous state $[eqn]$ , and the memory cell state $[eqn]$ is updated simultaneously. The calculation process is shown in Equation (2).

[eqn]

where $[eqn]$ , $[eqn]$ , and $[eqn]$ denote the output values of the forget gate, input gate, and output gate, respectively; $[eqn]$ and tanh(⋅) represents the sigmoid and hyperbolic tangent activation functions, respectively; $[eqn]$ and $[eqn]$ corresponds to the weight matrices and bias vectors of the respective gating units; $[eqn]$ is the feature vector at time step t output from the CNN pooling layer.

The last layer of the CNN-LSTM model is a fully connected layer, which is used to map the high-dimensional features output by the LSTM to the final displacement prediction results.

2.2. Bayesian Optimization Algorithms

Hyperparameter optimization is crucial for improving the predictive accuracy, generalization capability, and robustness of the model [37]. To improve the hyperparameter optimization strategy, this study introduced a Bayesian optimization algorithm to overcome issues such as hyperparameters becoming trapped in local optima, consequently enhancing the optimality of hyperparameter selection and the overall performance of the model. The core idea of the Bayesian optimization algorithm is to construct a posterior probabilistic model of the objective function and to intelligently select the next evaluation point by means of an acquisition function, therefore efficiently searching for the global optimum of a complex objective function [38]. The algorithm fully exploits information from historical observations of slope displacement, earth pressure, and rainfall, guiding the search process to converge toward regions of the potentially optimal solution. This optimization algorithm is typically implemented based on a Gaussian Process (GP). Given a set of previously selected points $[eqn]$ and their corresponding observations $[eqn]$ , the probabilistic distribution of the objective function $[eqn]$ is characterized by defining a mean function and a covariance function [39,40,41]. Specifically, $[eqn]$ where $[eqn]$ and $[eqn]$ denote the mean function and the covariance function of the Gaussian Process, respectively, as shown in Equation (3).

[eqn]

where $[eqn]$ is the signal variance, which is used to control the range of fluctuations of the function values.

The posterior probability distribution of the objective function $[eqn]$ can be obtained through the Gaussian process model. In this study, the objective function is defined as the RMSE of the model under hyperparameter x, and minimizing the RMSE is taken as the optimization objective. In order to select the next optimal sampling point, the expected improvement (EI) is used as the acquisition function. By selecting the parameter point with the largest EI value as the next evaluation point, the optimal solution of the objective function can be efficiently approximated, as shown in Equation (4).

[eqn]

where $[eqn]$ denotes the best function value observed so far, i.e., the optimal value among the evaluated points, and $[eqn]$ represents a possible value of the objective function at point $[eqn]$ .

2.3. CNN-LSTM Model Based on Bayes Optimization

Bayesian optimization was employed to tune the hyperparameters of the CNN-LSTM model, in which three key hyperparameters, namely the number of hidden units (NumOfUnits), the initial learning rate (InitialLearnRate), and the L2 regularization coefficient (L2Regularization), are adaptively optimized. The model’s workflow is shown in Figure 2.

Firstly, target monitoring points were selected, and the raw multi-modal monitoring data were collected. Cubic spline interpolation and the Pauta criterion (3σ rule) were then applied to preprocess the raw data. The processed data samples were subsequently divided into a training set and a test set.

Secondly, the range of hyperparameter values for the CNN-LSTM model was defined. Then, the optimal combination of hyperparameters was optimized using a Bayesian optimization algorithm, which was saved and subsequently used to train the CNN-LSTM model.

Finally, the test set was fed into the CNN-LSTM model to generate predictions. The predicted results were then denormalized to obtain the final slope displacement prediction values. Subsequently, the predictions were compared with displacement reference values, and the model performance was evaluated using relevant metrics such as MAE, RMSE, and R^2^.

2.4. Model Evaluation Metrics

In this study, three metrics were selected to evaluate the prediction model performance: mean absolute error (MAE), root mean squared error (RMSE), and coefficient of determination (R^2^) [42]. The corresponding calculation formulas were presented in Equation (5):

[eqn]

where $[eqn]$ and $[eqn]$ denote the reference and predicted values of slope displacement, respectively; $[eqn]$ represents the mean reference value of the slope displacement; and $[eqn]$ is the number of samples.

Among them, smaller values of MAE and RMSE indicate better model performance. R^2^ is primarily used to assess the goodness of fit of the prediction model, and a value closer to 1 implies superior predictive performance.

3. Experimental Analysis

3.1. Data Sources

The monitored slope is located in the GuShan open-pit iron mine, and the studied area is a post-mining bench slope. The area is located in a subtropical monsoon climate zone, characterized by distinct seasons, abundant sunshine, ample rainfall, and the simultaneous occurrence of high temperatures and rainfall. The slope monitoring area extends approximately 1100 m in the east-west direction and about 1000 m in the north-south direction, with the lowest elevation of −180 m at the slope toe. Geometrically, the slope is composed of 17 bench levels, each with an average height of approximately 12 m, and the slope angle ranges from 35° to 45°. The large slope angles combined with the multi-level bench structure result in a high susceptibility to geological hazards, such as landslides.

From an engineering-geological perspective, the slope comprises an upper Quaternary cover overlying a bedrock slope. The Quaternary deposits mainly include water-bearing gravel-cobble layers, interbedded with silty clay and fine sand strata. The bedrock is dominated by andesite, volcanic breccia, and tuff; the upper benches are commonly fractured and strongly weathered, exhibiting scattered-fragmented to fractured blocky structures that reduce the overall strength of the rock mass. In addition, groundwater is present as pore water, confined pore water within gravel–cobble layers, and bedrock fissure water. These groundwater components can couple with rainfall infiltration, thereby altering internal stress states, which are relevant to subsequent deformation responses.

In this study, the monitoring data were collected long-term after mining ceased (The GuShan mine was operated from 1954 to 2014). Therefore, the primary external forcing considered in the prediction task is rainfall, together with the associated stress response reflected by earth pressure measurements. To acquire multi-modal monitoring data, we deployed one GNSS reference station (GSJZ), eight GNSS monitoring stations (JC01~~JC08), one rain gauge (YL01), and three earth pressure gauges (TYL01~~TYL03). The spatial distribution of these sensors was shown in Figure 3, and the parameters of the related instruments were listed in Table 1.

To enhance the ability of monitoring data to characterize the influence of slope displacement, multi-modal monitoring data were sampled at a uniform interval of 3 h, covering the period from 12 November 2022, to 20 January 2024. To ensure the continuity and reliability of the time series analysis, the missing data were completed using cubic spline interpolation, and outliers in the monitoring time series were identified and removed based on the Pauta criterion (3σ rule). Figure 4 shows the monitoring results of surface displacement, rainfall [43,44], and earth pressure [45,46] for the slope. As can be observed from the figure, the slope displacement has some small-amplitude fluctuations, with an overall variation ranging from −10 mm to 10 mm, forming an inverted “S”-shaped fluctuation pattern. The overall variation in earth pressure is relatively small, mostly concentrated between −10 kPa and 13 kPa. Rainfall shows pronounced spatial and temporal unevenness, with clear seasonal variability [47]. Among them, rainfall is usually one of the important external factors that induce and accelerate slope displacement. Rainfall infiltration leads to a decrease in matric suction and an increase in the bulk density of the slope body, resulting in a decrease in shear strength and an increase in sliding force, thus affecting slope stability. Meanwhile, earth pressure [48,49] can serve as one of the representative variables of the internal stress state and stress adjustment process of the slope body, and its changes can reflect the deformation trend and stability of the slope.

Considering the differences in deformation characteristics, stress conditions, and multi-modal monitoring data among various locations of the slope, monitoring points JC03 and JC05, located on platforms at different elevations, were selected as the study objects. These two points were chosen to capture both the overall deformation trend of the slope and the localized differential settlement characteristics. The remaining monitoring points were excluded from the analysis due to issues such as data loss, equipment failure, or spatial redundancy.

3.2. Correlation Analysis

Grey relational analysis is a correlation analysis method based on grey system theory [50], commonly used to quantify the interaction and influence between two data sequences. Generally, if the grey relational degree is greater than 0.6, it is considered that the two have a high correlation [51]. The multi-modal dataset used in this study included five variables. The displacement data from monitoring points JC05 and JC03 were the dependent variables, while the other three variables (rainfall, TYL01, and TYL02) were considered independent variables. Figure 5 shows the data sequences of slope displacement, rainfall, and earth pressure, as well as the grey relational coefficients between them. Taking monitoring point JC05 as an example, its grey relational degree with both rainfall and TYL02 was 0.82. As shown in the figure, the overall trends in the amplitude and periodicity between slope displacement and earth pressure were consistent. For example, around 12 January 2023, a decrease in earth pressure corresponded to a reduction in the magnitude of slope displacement, and vice versa. This phenomenon can be attributed to the fact that an increase in earth pressure led to higher shear stress within the slope, thereby resulting in greater displacement. Rainfall led to more pronounced abrupt changes in slope displacement. In addition, Figure 5 shows that the grey relational degrees among all sets of monitoring data exceed 0.6, further confirming that slope displacement variations are strongly influenced by rainfall and earth pressure and exhibit strong correlations.

3.3. Hyperparameter Optimization and Model Training

The training sets and testing sets were constructed by using 70% and 30% of the multi-modal monitoring dataset, respectively. Specifically, 2456 sets of monitoring data from 00:00 on 12 November 2022 to 21:00 on 14 September 2023 were used as the training sets for model training, while 1024 sets of monitoring data from 00:00 on 15 September 2023 to 21:00 on 20 January 2024 were used as testing sets for performance evaluation. The parameter settings of the Bayes-CNN-LSTM model are shown in Table 2.

Taking the slope displacement data from the monitoring point JC05 as an example, Table 3 presents the optimization process of the key model hyperparameters. The Bayes-CNN-LSTM model employed a sliding window mechanism for prediction. Based on multiple pre-experiments, the previous 20-step displacement sequence (lagged monitoring samples) and the rainfall and earth pressure values at the prediction time step were used as input to predict the displacement at the corresponding time step. The key hyperparameters of the model were adaptively searched within the parameter space using the Bayesian optimization algorithm. As shown in Table 3, the optimization process was carried out for ten iterations, at which point the objective function (RMSE) showed a convergence trend. The objective function value generally decreased with the increase in the number of iterations. Among them, the first iteration performed well due to the relatively reasonable initial parameter settings, but the fifth iteration obtained the smallest RMSE value, indicating that the model performance reached an optimum in this search. Therefore, the hyperparameter combination obtained in the fifth iteration was selected as the optimal configuration of the model, with the NumOfUnits, InitialLearnRate, and L2Regularization being 27, 0.011142, and 2.383 × 10^−7^, respectively.

3.4. Prediction Results and Accuracy Analysis

To verify the superiority of the constructed Bayes-CNN-LSTM model in terms of prediction accuracy, several typical prediction methods, including CNN-LSTM [52], LSTM [53], CNN [54], SVM [55], TCN [56], and Transformer [57], were selected for comparison. Slope displacement of monitoring points JC03 and JC05 was predicted and analyzed. The prediction values and the absolute prediction errors of each model are illustrated in Figure 6 and Figure 7. To quantitatively evaluate the predictive performance of different models, Table 4 and Table 5 show the MAE, RMSE, and R^2^ evaluation indicators of each model’s predictions at the two monitoring points.

As can be observed from Figure 6 and Figure 7, the prediction values of all models generally followed the actual displacement trend. However, compared with the CNN-LSTM, LSTM, CNN, SVM, TCN, and Transformer models, the Bayes-CNN-LSTM model exhibited the best fitting performance in the slope displacement evolution process, indicating that this model can effectively capture the main variation characteristics of slope displacement.

Figure 6h and Figure 7h show the distributions of the absolute prediction errors of different models at monitoring points JC03 and JC05. The results indicated that the Bayes-CNN-LSTM model yielded the smallest overall prediction errors at both monitoring points. Specifically, at monitoring point JC03, more than 90% of the absolute prediction errors were less than 1.1 mm, and approximately 75% were less than 0.7 mm. At monitoring point JC05, over 90% of the errors were within 1.0 mm, with about 75% being less than 0.6 mm. Overall, these results demonstrated that the Bayes-CNN-LSTM model was capable of accurately learning the local fluctuation characteristics during slope displacement evolution, maintaining small and stable prediction errors in most cases, and exhibiting excellent predictive performance.

As indicated by Table 4 and Table 5, the conventional baselines (LSTM, CNN, and SVM) exhibited a certain level of predictive capability, but there was still room for improvement. Compared with these models, the CNN-LSTM model enhanced prediction performance, demonstrating that when faced with complex input features, CNN-LSTM can achieve collaborative modeling in both space and time, enabling more effective learning and training. In addition, we observed that the Transformer-based model delivers consistently strong and stable performance across monitoring points, whereas the TCN baseline is also competitive but exhibits point-dependent variability. By incorporating the Bayesian optimization algorithm, globally optimal hyperparameters can be identified, which further improved the prediction accuracy of the Bayes-CNN-LSTM model. For instance, at point JC03, the MAE, RMSE, and R^2^ values reached 0.470 mm, 0.660 mm, and 0.964, respectively; at point JC05, the MAE, RMSE, and R^2^ values were 0.417 mm, 0.576 mm, and 0.978, respectively. In summary, at point JC03, the MAE of the Bayes-CNN-LSTM model was decreased by 27.9%, 32.8%, 34.2%, 30.0%, 25.3%, and 23.5% compared with the CNN-LSTM, LSTM, CNN, SVM, TCN, and Transformer models, while the RMSE was decreased by 22.1%, 27.9%, 28.9%, 22.0%, 19.8%, and 17.0%, respectively. At point JC05, the MAE was decreased by 22.2%, 29.7%, 30.4%, 18.2%, 24.0%, and 11.8%, and the RMSE was decreased by 18.1%, 25.8%, 30.0%, 13.9%, 21.5%, and 7.8%, respectively. These results indicated that the Bayes-CNN-LSTM model can effectively capture slope displacement characteristics at different monitoring points, exhibiting superior prediction accuracy and stability.

3.5. Extrapolation Prediction Results and Analysis

Currently, displacement information remains the primary data source for slope monitoring. However, in practical engineering applications, GNSS displacement observations are often affected by certain factors such as equipment failures, signal obstruction, and complex observation environments, which can lead to some problems such as data loss, degraded data quality, and even monitoring failure. Meanwhile, extrapolative prediction of slope displacement has more important practical significance for disaster prevention and reduction. Therefore, two experiments were used to analyze the extrapolation prediction capability of the Bayes-CNN-LSTM model when GNSS is unavailable. In Experiment I, primarily analyzed the contribution of different external influencing factors to the extrapolation prediction based on the Bayesian-CNN-LSTM model, while Experiment II analyzed the predictive performance of the Bayes-CNN-LSTM model and other typical models. Both experiments were conducted based on trained models, using the same data with an interval of 3-h to predict slope displacement over the subsequent 24-h (8 steps) and 48 h (16 steps).

In Experiment I, we analyzed the contribution of different external influencing factors (rainfall and earth pressure) to slope displacement prediction by controlling the input feature set of the Bayes-CNN-LSTM model. Specifically, four input configurations were designed: (1) Scheme I used only the lagged displacement sequence as input; (2) Scheme II introduced rainfall data based on the lagged displacement sequence; (3) Scheme III introduced earth pressure data based on the lagged displacement sequence; (4) Scheme IV integrated the lagged displacement sequence, rainfall data, and earth pressure data simultaneously. Here, the lagged displacement sequence refers to the previous 20 samples with a 3-h interval (i.e., a 60-h time window), covering the period from 12:00 on 18 January 2024 to 21:00 on 20 January 2024. The results were shown in Figure 8 and Figure 9.

As can be seen from Figure 8, Scheme III, by introducing earth pressure data, significantly improved the model’s accuracy in predicting peak displacement and inflection points, with its predictions more closely matching the actual reference values at these key locations. After using rainfall data, Scheme II effectively enhanced the model’s ability to capture the overall slope displacement trend, with its predictions more closely resembling the actual reference values compared to Scheme III. Although introducing only one external factor (rainfall or earth pressure) in schemes II and III can improve model performance to some extent, the prediction error still shows significant fluctuations. Due to the comprehensive consideration of the effects of rainfall and earth pressure, Scheme IV got the best balance between trend fitting and detail capture, thus having the highest overall prediction accuracy.

The MAE and RMSE values of four schemes are shown as in Figure 9. Overall, compared with Schemes II and III, the MAE of Scheme IV was decreased by 14.5% and 47.1%, respectively, and RMSE decreased by 0.3% and 50.4%, respectively. And compared with Scheme I, MAE and RMSE were decreased by 53.5% and 49.6%, respectively. This fully demonstrated that external influencing factors played a crucial role in slope displacement prediction and that the reasonable fusion of rainfall and earth pressure information can improve the model’s extrapolation prediction performance and robustness.

Based on the above conclusions, Experiment II was conducted using lagged displacement sequence, rainfall data, and earth pressure data as inputs to further analyze the extrapolation performance of the Bayes-CNN-LSTM model and other typical models. The extrapolation prediction results are presented in Figure 10 and Figure 11.

As shown in Figure 10, the prediction accuracy of all models decreased with increasing extrapolation time and accumulating errors. The prediction error from the previous time step was propagated and accumulated as input for subsequent steps; therefore, the longer the prediction horizon, the more likely the uncertainty and bias will increase. However, the Bayes-CNN-LSTM model’s prediction results (red curve) still had relatively smaller prediction errors. Furthermore, it maintained high consistency with the displacement reference values in both long-term trends and short-term fluctuations, demonstrating excellent feature-capturing capability, especially at detailed points such as displacement turning points. In contrast, the prediction results output by some comparative models were smoother, with smaller fluctuations, and exhibited a certain degree of “under-sensitivity” to actual displacement changes. The shaded region in Figure 10 represented the 95% confidence interval of predictions from the Bayes-CNN-LSTM model constructed by quantile regression, which was used to characterize the uncertainty of the model’s prediction results.

Figure 11 shows the comparison of MAE and RMSE for each model with step lengths of 1 (3-h), 2 (6-h), 4 (12-h), 8 (24-h), 12 (36-h), and 16 (48-h). Overall, the Bayes-CNN-LSTM model demonstrated stable performance in short-to-medium-term extrapolation, especially at point JC03, achieving superior error levels across different step lengths. For example, when the extrapolation prediction step length was 8 (24-h), the MAE and RMSE of monitoring point JC03 were 1.20 mm and 1.46 mm. Compared with the CNN-LSTM, LSTM, CNN, SVM, TCN, and Transformer models, the MAE was decreased by 7.7%, 18.4%, 45.2%, 43.4%, 20.0%, and 43.7%, while the RMSE was decreased by 11.0%, 15.1%, 41.6%, 37.9%, 16.6%, and 39.9%, respectively. When the step length was further extended to 16 (48-h), the MAE and RMSE remained at 2.22 mm and 2.58 mm, respectively, indicating that the overall error growth was controllable and the trend tracking ability was strong. At point JC05, with a step length of 8 (24-h), the MAE and RMSE were 0.47 mm and 0.64 mm, decreased by 48.4%, 19.0%, 32.9%, 26.6%, 33.8%, and 63.0% in MAE and 35.4%, 20.0%, 20.0%, 15.8%, 42.9%, and 61.4% in RMSE compared with the other models, respectively. When the step length was 16 (48-h), some baseline models showed locally smaller values in MAE and RMSE, but as seen in the extrapolation curves in Figure 10, their output tended to be smoother, failing to adequately respond to local fluctuations and turning points. In contrast, Bayes-CNN-LSTM maintained better trend tracking and fluctuation characterization capabilities within the 36–48 h prediction range.

In brief, rainfall and earth pressure information both played a certain role in displacement prediction, and considering their combined effects can improve the sensitivity and adaptability of the model to complex nonlinear changes in slope displacement. And the Bayes-CNN-LSTM model maintained relatively strong stability and predictive capability under a GNSS-unavailable environment, enhancing the applicability of slope safety monitoring systems.

4. Conclusions

To improve the accuracy and stability of slope displacement prediction, a Bayesian-optimized CNN-LSTM prediction model is constructed in this study. This model effectively integrates the advantages of CNN and LSTM and introduces a Bayesian optimization algorithm to adaptively optimize the model hyperparameters. We analyze the predictive performance of the constructed model using multi-modal monitoring data from the Gushan slope, and the main conclusions are summarized as follows:

(1)The Bayesian-optimized CNN-LSTM model easily avoids the problem of hyperparameters becoming trapped in local optima. Experimental results show that this model performs well in terms of both prediction accuracy and fitting effect and can effectively predict slope displacement.
(2)Compared with other mainstream models, the constructed Bayes-CNN-LSTM model shows higher prediction accuracy. At monitoring point JC03, the MAE and RMSE are 0.470 mm and 0.660 mm, respectively, and the average decreased by 29.0% and 23.0% compared with the comparison models. At point JC05, the MAE and RMSE are 0.417 mm and 0.576 mm, respectively, and the average decreased by 22.7% and 19.5%.
(3)Regarding the influence of external factors on slope displacement prediction, experimental results show that the MAE and RMSE of the model predictions are 0.47 mm and 0.64 mm when using both rainfall and earth pressure, respectively. Therefore, the proper integration of multi-modal data can effectively improve the performance of slope displacement prediction models.
(4)The Bayes-CNN-LSTM model exhibits good extrapolation capability, demonstrating better prediction accuracy and stability even at longer prediction step lengths. For example, in the prediction step length of 8 (24-h), the MAE and RMSE at monitoring points JC05 are decreased by 30.2% and 24.6%, respectively. Future work will focus on collecting data from multiple slopes for generalized validation and simultaneously explore the integration of transfer learning and physical priors to improve applicability across different scenarios.

Bibliography57

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Cao H. Ma G. Liu P. Qin X. Wu C. Lu J. Multi-factor analysis on the stability of high slopes in open-pit mines Appl. Sci.202313594010.3390/app 13105940 · doi ↗
2Li Z. Li W. Xu Q. Lu H. Fu H. Guo P. Zhao J. Yu C. Preliminary analysis of the catastrophic February 22nd 2023 Xinjing open-pit mine landslide, Inner Mongolia, China Landslides 2024211053106710.1007/s 10346-024-02229-5 · doi ↗
3Wang Q. Xing A. Xu X. Zhou Y. Yang Q. Song H. Peng J. Zhai L. Bilal M. Liu Y. Formation mechanism and dynamic process of open-pit coal mine landslides: A case study of the Xinjing landslide in Inner Mongolia, China Landslides 20242154155610.1007/s 10346-023-02193-6 · doi ↗
4Özşen H. Kaygusuz B. Experimental analysis of various deep learning methods for predicting displacements in an open pit coal mine Nat. Hazards 2025121206292065410.1007/s 11069-025-07629-x · doi ↗
5Dey P. Chaulya S.K. Kumar S. Hybrid CNN-LSTM and Io T-based coal mine hazards monitoring and prediction system Process Saf. Environ. Prot.202115224926310.1016/J.PSEP.2021.06.005 · doi ↗
6Gao Y. Chen X. Tu R. Application of dynamic optimization time-delay GM(1,2) model in landslide displacement prediction considering the influence of rainfall Acta Geod. Cartogr. Sin.2022512183219510.11947/j.AGCS.2022.20220290 · doi ↗
7Wang J. Jaboyedoff M. Chen G. Luo X. Derron M.-H. Hu Q. Fei L. Prajapati G. Choanji T. Luo S. Landslide susceptibility prediction and mapping using the LD-Bi LSTM model in seismically active mountainous regions Landslides 202421173410.1007/s 10346-023-02141-4 · doi ↗
8Wu D. Zhou B. Zimin M. Prediction of landslide displacement based on the CA-stacked transformer model Alex. Eng. J.202512438940310.1016/j.aej.2025.03.140 · doi ↗