Optimized Model Predictive Controller Using Multi-Objective Whale Optimization Algorithm for Urban Rail Train Tracking Control

Longda Wang; Lijie Wang; Yan Chen

PMC · DOI:10.3390/biomimetics11010060·January 10, 2026

Optimized Model Predictive Controller Using Multi-Objective Whale Optimization Algorithm for Urban Rail Train Tracking Control

Longda Wang, Lijie Wang, Yan Chen

PDF

Open Access

TL;DR

This paper introduces an improved control system for urban trains that reduces energy use and improves performance using an optimized algorithm.

Contribution

A novel MPC-MOWOA control strategy with enhanced convergence and adaptability for urban rail train tracking.

Findings

01

The proposed control strategy reduces energy consumption by nearly 5.9% compared to Fuzzy-PID.

02

Integral of time-weighted absolute error is reduced by nearly 39.6% with the new method.

03

Punctuality, stopping accuracy, and comfort improve by 33.2%, 12.4%, and 7.1%, respectively.

Abstract

With the rapid development of urban rail transit, train operation control is required to meet increasingly stringent demands in terms of energy consumption, comfort, punctuality, and precise stopping. The optimization and tracking control of speed profiles are two critical issues in ensuring the performance of automatic train operation systems. However, conventional model predictive control (MPC) methods are highly dependent on parameter settings and show limited adaptability, while heuristic optimization approaches such as the whale optimization algorithm (WOA) often suffer from premature convergence and insufficient robustness. To overcome these limitations, this study proposes an optimized model predictive controller using the multi-objective whale optimization algorithm (MPC-MOWOA) for urban rail train tracking control. In the improved optimization algorithm, a nonlinear convergence…

Figures13

Click any figure to enlarge with its caption.

Funding6

—Liaoning Provincial Department of Transportation Scientific Research Project
—Liaoning Provincial Department of Education Scientific Research Project
—Dalian Universities Teaching and Scientific Research Ability Improvement for Excellent Young Teachers’ Project
—Chizhou University Nature Key Project
—Key Projects of Natural Science Research in Universities of Anhui Province
—Chizhou University Nature Transverse Project

Keywords

urban rail traintracking controlmulti-objective whale optimization algorithmmodel predictive controllercontrol parameters

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRailway Systems and Energy Efficiency · Railway Engineering and Dynamics · Advanced Data Processing Techniques

Full text

1. Introduction

Urban rail transit systems are distinguished by their efficient, low-carbon operation and high passenger capacity, and have become a core component and essential infrastructure of modern urban transportation networks. They serve as a key pathway to achieving low-carbon urban mobility and smart travel [1]. With the continuous advancement of automation and intelligence, passengers place increasing demands on the safety and comfort of train operations. As the core of automatic train operation systems, the optimization and precise tracking of speed trajectories directly determine overall operational performance. Although a variety of efficient speed trajectory optimization methods have been proposed in recent years, relying solely on the optimal speed profile is insufficient to ensure performance. High-precision trajectory tracking control strategies are also required to guarantee stable execution of the scheduled speed [2]. At present, PID control remains one of the most commonly used control strategies in urban rail train operation control systems. Owing to its error-feedback mechanism, PID control features a simple structure and ease of implementation, and has therefore been widely adopted in engineering practice [3]. However, conventional PID control lacks the capability to predict future system states, making it difficult to effectively cope with nonlinear characteristics and external disturbances under complex operating conditions. As a result, limitations persist in terms of speed trajectory tracking accuracy, energy consumption control, and passenger comfort [4]. To enhance control performance, recent studies have increasingly combined PID control with artificial intelligence techniques and meta-heuristic optimization algorithms [5], such as particle swarm optimization (PSO) and genetic algorithms (GA), to achieve intelligent tuning of controller parameters [6]. Although these approaches improve the robustness of PID control to some extent, their primary focus remains on parameter optimization. Consequently, they do not fundamentally address the lack of predictive capability in the control strategy, leaving further room for improvement in high-precision trajectory tracking under complex operating conditions. Therefore, the development of more efficient and robust control strategies that can maintain high-precision trajectory tracking under complex operating conditions is of great significance for improving the performance of urban rail transit systems.

The trajectory tracking control problem of urban rail trains is, in essence, a typical issue of optimal control and stability control. Such problems are also common in other domains. For example, Hentout et al. proposed a strategy combining shortest-path planning with fuzzy logic control to achieve efficient path tracking of mobile robots in both static and dynamic indoor environments [7]. Wen et al. developed an improved cascade control method for the single-point levitation system of maglev trains to ensure stable levitation [8]. Chai et al. presented an optimal trajectory tracking guidance approach based on receding horizon control for spacecraft reconnaissance missions, ensuring flight stability under complex dynamic conditions [9]. These studies indicate that, whether in robotic navigation, maglev systems, or spacecraft trajectory control, optimization and stability are consistently central objectives in control design. Accordingly, speed tracking of urban rail train under complex operating conditions must simultaneously consider operational efficiency and system stability.

Consequently, researchers have introduced methods such as optimal control and robust control into train operation control to compensate for the limitations of PID control. However, these approaches often rely heavily on precise mathematical modeling of the train system, a requirement that is difficult to fully satisfy given the complex and variable operational environment of urban rail transit [10]. In recent years, to further enhance the trajectory tracking accuracy and stability of automatic train operation systems, various advanced control methods have been proposed and investigated, including adaptive control, sliding mode control (SMC), iterative learning control (ILC), active disturbance rejection control (ADRC), and model predictive control (MPC). Specifically, Huang et al. proposed a parking control method based on robust adaptive backstepping, which significantly improved train stopping accuracy, though it imposes a high computational burden and still suffers from limited real-time performance under complex conditions [11]. Kang et al. designed a speed-tracking controller based on fuzzy SMC, effectively enhancing system robustness and tracking performance, but it is highly dependent on parameter tuning [12]. In addition, Liu et al. employed a multi-mass dynamic model to develop adaptive ILC, achieving precise tracking under velocity constraints; however, the high algorithmic complexity limits its real-time applicability [13]. Wang et al. incorporated the artificial bee colony algorithm into ADRC design to improve disturbance rejection performance, though the convergence speed of the algorithm remains a concern [14]. Overall, while these methods have advanced trajectory tracking and disturbance rejection performance, they generally suffer from limited real-time capability and strong parameter dependence. In contrast, MPC, with its superior multi-constraint handling and receding-horizon optimization features, can achieve higher accuracy and stability in trajectory tracking under complex operating conditions. For instance, Felez et al. applied MPC to train virtual coupling systems, significantly improving the coordinated control of multiple train speed trajectories [15], while Novak et al. proposed an energy-efficiency-prioritized MPC method, which effectively reduced traction energy consumption while maintaining smooth velocity profiles [16]. Therefore, owing to its outstanding performance, MPC has gradually become a research focus in the field of urban rail train tracking control. A comparison of urban rail train control methods is provided in Table 1.

MPC was initially widely applied in process industries and, owing to its advantages in receding-horizon optimization and multi-constraint handling, has gradually been extended to domains such as motor control, energy management, and vehicle control [17]. In the trajectory control of urban rail train, MPC predicts future operational states based on the train dynamic model and optimizes traction and braking commands in real time at each sampling interval, thereby achieving high-precision speed tracking. However, conventional MPC in complex operating environments, such as frequently varying gradients and curves, or open coastal lines-relies heavily on model accuracy and parameter settings, which can limit control performance. To address these limitations, various improved MPC (IMPC) methods have been proposed. For example, Zhou et al. developed distributed MPC (DMPC) to achieve coordinated local and global tracking, though its adaptability and accuracy remain limited [18]. Chen et al. designed hierarchical optimization MPC (HO-MPC), which performs well in trajectory planning and scheduling but depends on accurate models and lacks adaptive capability [19]. To enhance stability, Wang et al. combined fuzzy logic with robust predictive control (RFPC) to improve tracking accuracy on high-frequency lines, but real-time adaptability is still constrained [20]. Additionally, Gao et al. employed a Koopman-based model to develop collaborative braking MPC (CB-MPC), enhancing multi-train consistency control, although its generalization capability is limited [21]. Chen et al. proposed multi-objective parameter-tuning MPC (MOT-MPC), which improves responsiveness and stability but relies on offline parameter adjustment and lacks flexibility for dynamic environments [22]. Overall, IMPC has partially improved trajectory tracking performance, but it remains constrained by model dependency and parameter sensitivity.

Building on this foundation, researchers have recently attempted to integrate intelligent optimization algorithms into the MPC framework to further enhance its adaptability and global optimization capability in complex trajectory tasks [23]. For example, Nobahari and Nasrollahi embedded ant colony optimization (ACO) into nonlinear MPC to improve robustness and accuracy; however, the method is highly sensitive to parameter settings, making it less suitable for frequently changing train operating environments [24]. Diwan and Deshpande proposed a fast MPC approach combining parallel PSO with a divide-and-conquer strategy, which increases computational efficiency but still faces trade-offs between convergence speed and accuracy in multi-objective coordination [25]. Lin et al. designed a multi-objective PSO strategy for virtually coupled systems to achieve a compromise among energy consumption, safety, and tracking precision, yet its applicability to general trajectory tracking scenarios is limited [26]. Furthermore, Wang et al. employed wavelet neural networks to model nonlinear dynamics and integrated them with MPC; although this improves nonlinear response, the approach suffers from complex model training and limited robustness [27]. Among existing metaheuristic optimization algorithms, the Whale Optimization Algorithm (WOA) has attracted increasing attention in recent years due to its simple structure, limited number of control parameters, and its ability to maintain a favorable balance between global exploration and local exploitation [28]. These characteristics make WOA particularly suitable for integration into MPC frameworks, especially for parameter tuning problems that impose stringent requirements on real-time performance and constraint handling. Compared with GA and PSO, WOA exhibits lower dependence on parameter tuning and demonstrates better scalability when applied to complex control structures. Nevertheless, the standard WOA still inevitably suffers from issues such as premature convergence and degradation of population diversity in multi-objective optimization problems, which motivates further improvements to the algorithm and its deep integration with MPC in this study.

Biomimetics seeks to distill generalizable structural characteristics and behavioral principles from biological systems and transfer them into engineering-oriented frameworks for modeling, optimization, and control. In recent years, swarm intelligence optimization algorithms inspired by collective behaviors observed in nature have attracted growing attention, owing to their strong global search capability, adaptive behavior, and robustness when addressing complex nonlinear, multi-constraint, and multi-objective engineering problems. Representative approaches, such as PSO, ACO, and theWOA, emulate information exchange, cooperative search, and environmental adaptation among biological individuals, thereby enabling efficient solutions to challenging optimization tasks and providing an effective methodological bridge between natural intelligence and engineering control applications.

Although previous studies have made some progress in MPC parameter optimization and the application of metaheuristic algorithms, most existing work still focuses on single-objective optimization or multi-objective frameworks with relatively weak coupling between objectives. At the same time, no prior research has systematically incorporated the Tchebycheff decomposition method into the WOA for multi-objective tuning of MPC parameters in urban rail transit systems.

From the biomimetics perspective, the WOA is derived from an abstract representation of the cooperative bubble-net feeding behavior of humpback whales. By combining prey encircling, spiral updating, and stochastic search mechanisms, the algorithm achieves a dynamic balance between global exploration and local exploitation, reflecting the adaptive decision-making characteristics of biological swarms in complex environments [29]. The optimization process constructed on this behavioral principle exhibits strong capabilities in multi-objective optimization and constraint handling, enabling effective coordination among competing performance indices. As a result, it is particularly well suited for multi-objective tuning of MPC parameters in urban rail train trajectory tracking applications.

This paper proposes a model predictive controller with multi-objective whale optimization algorithm (MPC-MOWOA). Based on a multi-objective evaluation model constructed for urban rail train tracking control, the MOWOA is introduced to effectively solve the challenge of control parameter tuning. Furthermore, a MPC is developed based on an online adjustment mechanism, which incorporates a softening factor and a correction coefficient. The effectiveness of the proposed method in enhancing the trajectory tracking precision of urban rail train is verified through simulation results. The comprehensive framework underlying this research is depicted in Figure 1.

The main contributions are summarized as follows:

(I) This study proposes a multi-objective comprehensive evaluation framework for urban rail train tracking control. The proposed framework evaluates tracking control performance from multiple dimensions, effectively addressing the limitations of conventional methods that rely on a single performance metric to assess control effectiveness. By jointly considering tracking accuracy, comfort, energy consumption, and punctuality, the evaluation framework enables a more comprehensive and objective assessment of the practical performance of different tracking control strategies. As a result, it provides a unified and quantitative basis for performance comparison and control parameter optimization.

(II) This study proposes the MOWOA. By introducing a nonlinear convergence factor and an adaptive weighting mechanism, the global search capability is enhanced, enabling the tuning of the controller’s initial parameters and thereby significantly improving control performance. Meanwhile, improvements are also made to the MPC: a softening factor with online adjustment based on fuzzy satisfaction indices is introduced to dynamically balance constraint satisfaction and performance enhancement; in addition, an adaptive correction coefficient strategy is proposed, which ensures system robustness while accelerating the response speed.

(III) This study conducts hardware-in-the-loop simulation experiments (HILSEs) to validate the effectiveness of the proposed approach in real-world operating environments. The experimental results demonstrate that, compared with traditional MPC and other control methods, the proposed strategy achieves superior performance in terms of comfort, energy consumption, and tracking accuracy. Furthermore, this study, for the first time, integrates the improved multi-objective optimization algorithm with MPC for urban rail train tracking control, thereby verifying the effectiveness and superiority of the method under complex operating conditions.

The structure of the paper is arranged as follows. Section 2 introduces the multi-objective evaluation model. Section 3 presents the MOWOA for optimizing MPC control parameters. Section 4 presents the MPC incorporating the online adjustment mechanism combining a softening factor and a correction coefficient. Section 5 presents the simulation experiments and corresponding analysis. Finally, Section 6 summarizes the conclusions of this study.

2. Multi-Objective Evaluation Model for Urban Rail Train Tracking Control

2.1. Train Dynamics Model and Assumptions

To implement urban rail train tracking control, the system must adhere to the dynamic equations of train operation outlined below. These equations describe the longitudinal motion of the urban rail trains, illustrating how position, velocity, and the acting force are interconnected in the direction of travel. The model serves as the basis for subsequent control algorithm design and trajectory optimization. The system dynamics are given as:

[eqn]

where x represents the current location of the urban rail trains; t represents the real-time operation duration of the train; and V is the exact running velocity of the train; $[eqn]$ represents the force generated by the train; depending on the operating condition, it may act as either a driving or braking force; $[eqn]$ represents the total resistance experienced by the train, including both the basic resistance and additional resistance, expressed as $[eqn]$ . Here, $[eqn]$ represents the basic resistance at the current train speed, while $[eqn]$ represents the additional resistance at the current position, which accounts for external disturbances affecting train operation, such as gradient variations, aerodynamic fluctuations, and environmental factors.

[eqn]

where $[eqn]$ denotes the mass reflecting inertia of the urban rail train, y denotes the slewing mass factor of the train, and $[eqn]$ denotes the real mass of the train.

The function $[eqn]$ represents the interaction among the control input u, the train’s operating speed V, and its dynamic characteristics. Under traction mode, it reflects the traction performance of the train, whereas under braking mode, it characterizes the deceleration behavior. Here, u represents the control command applied to regulate the driving or braking system of the urban rail train [30].

2.2. Train Model

The train model serves as the foundation for automatic train operation and speed trajectory tracking control. However, due to system complexity and the coupled effects of multiple factors such as traction, braking, and resistance, it is challenging to obtain an accurate train dynamic model. The traction system of an urban rail train primarily consists of a control unit, braking resistors, and power converters; therefore, the train cannot be simply approximated as a single-point mass model. In practice, the traction force is generated by traction motors according to notch control commands, and the longitudinal motion of the train is realized under the combined influence of traction force and various running resistances.

Assumption 1. The train longitudinal dynamics can be approximated as a linear and time-invariant system within the studied time domain. Nonlinear characteristics, such as traction and resistance, are simplified through equivalent modeling or parameter linearization. Moreover, the collected operational experimental data are sufficient to represent the typical dynamic behavior of the system, making them suitable for system identification.

On the basis of Assumption 1, and using operational experimental data, the system is identified by the least squares method, yielding a second-order transfer function model of train longitudinal motion. Its general form can be expressed as follows:

[eqn]

where $[eqn]$ represents a fixed component corresponding to the train’s internal response, reflecting the intrinsic characteristics of the train that relate the traction level to the generated control force; $[eqn]$ represents a variable component corresponding to the train’s external response, capturing the extrinsic characteristics that describe the relationship between the control force and the train’s operational speed and position.

After establishing the train model, it is necessary to discretize the continuous time transfer function, since the model predictive controller requires a discrete-time representation. In this study, the zero-order hold (ZOH) method was adopted to convert the continuous transfer function into a discrete model with the specified sampling period, ensuring consistency between the plant model and the controller in the discrete domain.

It should be noted that the longitudinal train dynamics model employed in this study is a linear or quasi-linear approximation, used primarily for state estimation and control optimization within the MPC prediction layer, rather than an exact representation of the train’s dynamics under all operating conditions. The model parameters are obtained through system identification or empirical calibration and can reflect the average dynamic characteristics under typical operating scenarios. However, the model may not fully capture extreme nonlinear behaviors, such as wheel slip, spin, or emergency braking, and the uncertainty in model parameters has not been systematically verified in this study, which may have some impact on control performance. It is worth emphasizing that the inherent rolling optimization and feedback correction mechanism of MPC can partially mitigate the deviations caused by model simplifications and parameter uncertainties, thereby maintaining control performance stability and robustness while ensuring real-time computability.

2.3. Constraints and Evaluation Model for Urban Rail Train Tracking Control

In the process of train operation modeling and control design, in addition to accurately describing the train’s dynamic equations, it is also necessary to introduce a series of reasonable assumptions to ensure both the solvability of the model and the feasibility of simulation. These assumptions not only reflect the safety regulations and physical constraints inherent in practical operation but also incorporate appropriate idealizations and simplifications. In this way, the validity of the research outcomes is preserved while avoiding excessive model complexity that would hinder implementation. Specifically, this study clarifies the assumptions made with respect to speed constraints, resistance characteristics, acceleration and deceleration limits, operating mode transitions, and boundary conditions.

Assumption 2. The train speed at any position must not exceed the line speed limit to prevent safety risks caused by overspeeding. The total resistance acting on the train consists of basic resistance and additional line resistance, providing a more realistic representation of the forces encountered along the track. In addition, the absolute values of acceleration and deceleration must not exceed the prescribed maximum to ensure operational stability and comfort.

Assumption 3. The traction and coasting modes cannot be switched directly and must go through an intermediate transition process to avoid sudden impacts on the traction system and passengers. Furthermore, the train’s initial and terminal speeds are both set to zero, the initial time is zero, and the errors in stopping position and time must remain within the allowable limits to ensure that the train can safely stop at the designated location and satisfy scheduling requirements.

In this study, based on Assumptions 2 and 3, the train operation process can be mathematically described while ensuring both the solvability of the model and the feasibility of simulation. These assumptions provide explicit constraints on speed, resistance, acceleration and deceleration, operating mode transitions, and boundary conditions. Furthermore, the constraints listed in Equation (4) are treated as hard constraints, which must be strictly enforced in train tracking control to ensure operational safety, comfort, punctuality and all.

[eqn]

where $[eqn]$ and $[eqn]$ represent the actual speed and speed limit at the location point x, respectively; $[eqn]$ and $[eqn]$ represent the basic resistance under the speed value V and the additional resistance of the line at the location point x, which includes grade, curve and tunnel resistance, respectively; $[eqn]$ denotes the absolute magnitude of the train’s acceleration or deceleration during the k-th time cycle, while $[eqn]$ denotes the permissible upper bound for these dynamic responses. $[eqn]$ represents the rate of impact experienced by the train during the k-th cycle; $[eqn]$ represents the maximum train impact rate allowed; $[eqn]$ represents the manoeuvring mode under the k-th cycle, and there are five kinds of manoeuvring modes, namely, maximum driving force, partial driving force at constant speed, idling, partial braking force at constant speed, and maximum braking, which are denoted by 1, $[eqn]$ , 0, $[eqn]$ , and −1, respectively, where $[eqn]$ , $[eqn]$ ; X and $[eqn]$ represent the actual and expected running distances of the automatic train operation, respectively; $[eqn]$ and $[eqn]$ represent the actual and expected running times, respectively; $[eqn]$ and $[eqn]$ correspond to the maximum allowable position error and time error for stopping, respectively.

In addition to the hard constraints described in Equation (4), the urban rail systems must also satisfy supplementary performance constraints to ensure system stability, passenger comfort, and control accuracy. In particular, the integral of time-weighted absolute error ( $[eqn]$ ) is introduced as a commonly used performance index in tracking control. The $[eqn]$ provides a quantitative assessment of the tracking performance of automatic train operation. The specific formula are as follows:

[eqn]

where $[eqn]$ represents the absolute error between the actual tracking control speed of the train and the target speed under the actual running time t.

Assumption 4. At any position, the error between the controlled speed and the target speed must not exceed the prescribed maximum allowable value to ensure accurate tracking of the target speed profile. Both the control input and its increment should remain within the specified upper and lower bounds to prevent excessive or abrupt controller outputs that could compromise system stability or comfort. In addition, the integral of the absolute value of the $[eqn]$ must not exceed the prescribed threshold, ensuring that the cumulative effect of speed tracking errors over the entire operational cycle remains controllable.

[eqn]

where $[eqn]$ represents the maximum absolute error in the train’s actual speed relative to the target speed; $[eqn]$ and $[eqn]$ represent the control input and the control increment at the k-th time, respectively, here, k represents the discrete time index of the control system; $[eqn]$ , $[eqn]$ , $[eqn]$ and $[eqn]$ represent the upper and lower bounds of the control input and its incremental change, respectively. $[eqn]$ represents the maximum allowed time times error absolute value.

The urban rail train tracking control inherently requires the simultaneous assessment of multiple performance criteria. Accordingly, this study constructs a multi-objective optimization model that incorporates $[eqn]$ , stopping accuracy, punctuality, comfort, and energy consumption as the key control performance indices. The selection of these five objectives is not arbitrary but follows the regulations set by metro operation authorities and is closely related to the constraints defined in Equation (4). Specifically, the $[eqn]$ , as a typical performance indicator of trajectory tracking, quantitatively measures the effectiveness of speed curve tracking. The stopping accuracy corresponds to the deviation between the actual and desired stopping positions, ensuring accurate station stops. The punctuality corresponds to the time constraint, ensuring punctual operation. The comfort index corresponds to the acceleration constraint, reflecting the passenger riding experience. The total energy consumption corresponds to the resistance and additional energy demand of the train, reflecting the energy efficiency characteristic imposed by the resistance constraint. By optimizing these five objectives, the proposed model can comprehensively enhance train operation quality, comfort, punctuality, and energy consumption under strict constraint satisfaction. The evaluation model is formulated as follows:

[eqn]

where $[eqn]$ is the distance error of the train stopping position point; $[eqn]$ is the time error of the train stopping position point; $[eqn]$ is adopted as the comfort evaluation index in this study. Its definition is based on prior research and the international standard ISO 2631 [31], which evaluates passenger comfort according to the degree of discomfort induced by vibrations. According to ISO 2631 and related literature, passengers’ perception of comfort is not only influenced by the magnitude of acceleration but is also more sensitive to changes in acceleration. Therefore, in this work, $[eqn]$ is quantitatively defined as the absolute value of acceleration variation per unit distance or per unit time, providing a physically meaningful and widely recognized metric for assessing train comfort. E represents the total energy consumption, accounting for both traction and braking forces along the displacement of the train; $[eqn]$ represents a small incremental distance along the train’s path and serves as the integration variable.

Generally, a reduction in $[eqn]$ indicates an improvement in the dynamic tracking performance of the train speed profile, which typically contributes to lower energy consumption, enhanced comfort, and improved stopping accuracy and punctuality. However, these performance indices are not fully consistent under different operating conditions and may still involve certain trade-offs; therefore, multiple performance metrics need to be jointly considered in the optimization modeling process.

The $[eqn]$ , distance error $[eqn]$ , time error $[eqn]$ , comfort level $[eqn]$ , and energy consumption E are all evaluation indexes for tracking control of urban rail train, and they need to be made as small as possible [32].

3. Multi-Objective Whale Optimization Algorithm

3.1. Whale Optimization Algorithm

The whale optimization algorithm (WOA) is a nature-inspired optimization technique that simulates the foraging behavior of humpback whales and exhibits excellent optimization performance with strong global search capabilities [33]. During hunting, humpback whales not only encircle their prey but also approach it along a spiral-shaped trajectory while simultaneously shrinking the encircling circle.

The core idea of the basic WOA can be summarized into three main behavioral mechanisms:

(I) Encircling prey behavior: whales determine the distance to their prey from their current location and subsequently adjust their path to move toward it. This process can be mathematically represented as follows:

[eqn]

where $[eqn]$ represents the distance between the whale’s position $[eqn]$ and the best solution $[eqn]$ ; $[eqn]$ represents the position vector of the current optimal solution; $[eqn]$ represents the absolute value of the difference between $[eqn]$ and whale $[eqn]$ , $[eqn]$ represents the position vector of a randomly selected whale, H and C represent correlation coefficients, $[eqn]$ , $[eqn]$ , $[eqn]$ and $[eqn]$ represent random numbers within the interval (0, 1); a represents the convergence factor. p represents the behavior selection probability of the whale, $[eqn]$ ; $[eqn]$ represents the probability that the whale chooses to attempt encircling prey behavior, $[eqn]$ , $[eqn]$ represents the probability that the whale chooses to perform bubble-net spiraling hunting behavior; b represents the coefficient that controls the spiral shape adjustment, generally a constant 1; l represents a randomly chosen value within the interval (−1, 1).

(II) Prey search behavior: During the hunting process, humpback whales randomly search for prey. In the basic WOA, when $[eqn]$ , A leading individual in the search process $[eqn]$ is chosen randomly and the other whales update their positions based on the leader’s location. This guides the whales away from the current prey in order to find more suitable targets, aiming to enhance the algorithm’s global search ability. The position update formula for whale individuals performing random prey search in the basic WOA is presented below:

[eqn]

(III) Spiral position update: this simulates the whale approaching the prey along a spiral path, enhancing local search capabilities.

In summary, the WOA constructs an optimization framework that balances exploration and exploitation through the coordinated operation of the three core mechanisms described above. However, during the execution of the basic WOA, a linearly decreasing convergence factor is typically introduced to further enhance the algorithm’s adaptability and robustness.

[eqn]

where t is the present optimization iteration count, $[eqn]$ is the maximum optimization iteration count.

3.2. Tchebycheff Decomposition for Multi-Objective Optimization

Through decomposition, the inherently complex nature of multi-objective problems are reduced by breaking them down into scalarized subproblems that are easier to solve. Among these, the Tchebycheff decomposition method is particularly effective. The specific formulation of the sub-optimization problem based on the Tchebycheff decomposition is described as follows:

[eqn]

where $[eqn]$ represents the point of reference; $[eqn]$ represents the minimum value of each objective function up to the current point is taken as the optimal solution, n is the amount of objectives, x is the variables that determine the outcome; $[eqn]$ , $[eqn]$ and $[eqn]$ represent the i-th objective after and before normalization, respectively; while $[eqn]$ and $[eqn]$ represent the corresponding tolerance values, specifically the appropriate upper and lower bounds; $[eqn]$ represents the weight vector; $[eqn]$ represents the weight corresponding to the i-th objective; $[eqn]$ ; $[eqn]$ is the value of the aggregation function for decision variable x under the vector of weights $[eqn]$ and reference value $[eqn]$ .

For the single-objective subproblems derived from the Tchebycheff decomposition, the calculation formula for the weight corresponding to each objective in the weight vector is given as follows:

[eqn]

The optimal solution $[eqn]$ of the single-objective subproblem derived from the Tchebycheff decomposition with respect to the weight vector $[eqn]$ corresponds to the Pareto optimal solution. It represents the unique intersection point between the line $[eqn]$ in the n-dimensional space and the pareto front.

The decomposition-based method yields a group of mutually non-dominated optimal solutions corresponding to different weight vectors, making it difficult to select a preferred solution. To address this, a predefined objective weight vector $[eqn]$ is required, which is composed of the weight coefficients $[eqn]$ assigned to each optimization objective $[eqn]$ . $[eqn]$ , the corresponding weight coefficient $[eqn]$ reflects the relative importance of the optimization objective $[eqn]$ . Typically, $[eqn]$ is set. The final optimal solution x is selected from the set of optimized solutions $[eqn]$ as the one whose weight vector $[eqn]$ is closest to the objective weight vector $[eqn]$ , that is, the following equation holds.

[eqn]

where $[eqn]$ represents the cosine of the angle between the weight vector $[eqn]$ of the optimal solution x and the target weight vector $[eqn]$ .

3.3. Improved Multi-Objective Whale Optimization Algorithm

Building upon the fundamental mechanisms of the traditional WOA, this paper introduces a multi-objective whale optimization algorithm (MOWOA). This algorithm introduces a nonlinearly decreasing convergence factor based on the natural constant, as well as a variable rate adjustment factor. Additionally, to address complex multi-objective optimization problems, the Tchebycheff decomposition method is employed to decompose them into multiple independent single-objective subproblems, Compared with the weighted sum method and the boundary intersection approach, Tchebycheff decomposition is capable of effectively handling non-convex Pareto fronts and is insensitive to differences in the scales and dimensions of objective functions, thereby exhibiting favorable numerical stability. In addition, this method can be readily integrated with the rolling optimization mechanism of MPC, enabling improvements in trajectory tracking accuracy and control performance while maintaining optimization effectiveness and engineering feasibility. By employing this decomposition approach, MOWOA enhances its ability for both global search and local refinement, thereby improving the algorithm’s performance in multi-objective optimization. The specific formula for calculating the convergence factor a is given as follows:

[eqn]

where $[eqn]$ is the variable rate adjustment factor used for the nonlinear decrease of the convergence factor a. The adaptive decreasing trend of a can be adjusted through the factor $[eqn]$ . It is important to select a practical and problem-specific value of $[eqn]$ based on the characteristics of the optimization problem, to achieve better optimization performance. Therefore, this variable rate adjustment strategy can dynamically balance the convergence rate and convergence accuracy of the proposed method during the iteration process, thereby enhancing its global convergence capability to the greatest extent possible.

To enhance the tracking performance of the MPC under complex and dynamic operating conditions in urban rail systems, this paper proposes a method using MOWOA for optimizing control parameters. The design highlights of MOWOA lie in the introduction of a nonlinearly decreasing convergence factor based on the natural constant and a variable rate adjustment mechanism. Meanwhile, the Tchebycheff decomposition method is employed to transform the multi-objective optimization problem into multiple independent single-objective subproblems. These enhancements improve the global search and local refinement capabilities of the optimization process, allowing the MPC to obtain better-initialized parameters.

As shown in Figure 2, the overall optimization procedure of the MOWOA can be summarized as follows. The algorithm initializes the whale population and evaluates the multi-objective function values of each individual. The Tchebycheff decomposition method is then applied to construct reference points and weight vectors, converting the multi-objective problem into a set of scalar subproblems for fitness evaluation. During the iterative process, a nonlinearly decreasing convergence parameter based on the natural constant and dynamically updated coefficient vectors are used to regulate the search behavior, enabling alternation among encircling, random exploration, and spiral updating strategies to balance global exploration and local exploitation. An elite selection mechanism is further introduced to update the best individual and the population positions. Upon meeting the termination condition, the algorithm outputs the optimal solution set, thereby providing well-initialized MPC parameters and enhancing the tracking control performance of urban rail trains under complex operating conditions.

3.4. The Tuning Framework of Control Parameters for Multi-Objective Performance Optimization

The proposed MPC, which features simultaneous online adjustment of the control parameter softening factor and correction coefficient, encompasses seven parameters to be tuned. These parameters include the maximum value of the softening factor $[eqn]$ , the minimum value of the softening factor $[eqn]$ , the trend control factor of the softening factor $[eqn]$ , the speed error quantization factor $[eqn]$ , the initial value of the correction coefficient $[eqn]$ , the speed error rate quantization factor $[eqn]$ , and the proportional adjustment factor of the correction coefficient $[eqn]$ , along with the shape of their membership functions. The goal of tuning these parameters is to simultaneously optimize key evaluation indicators, including the distance error ( $[eqn]$ ), time error ( $[eqn]$ ), comfort index ( $[eqn]$ ), traction energy consumption (E), and the $[eqn]$ , to ensure optimal train tracking performance under varying operating conditions.

To achieve optimal automatic train tracking performance, the MOWOA based on Tchebycheff decomposition is employed to optimize and tune an initial set of control parameters. During the optimization process, whenever a candidate set of initial parameters requires evaluation, the optimization is temporarily paused. The urban rail train tracking control algorithm is then executed using this parameter set until the corresponding performance indicators are obtained.

In essence, MOWOA iteratively generates optimized solutions, while the tracking control algorithm evaluates each solution by simulating the tracking process under real urban rail operating scenarios using predefined target speed trajectory. The quality of each solution is assessed based on the Pareto optimality principle, the Tchebycheff aggregation function value, and its cosine similarity with the target weight vector ( $[eqn]$ ). The MOWOA is primarily employed for the offline self-tuning of key MPC parameters, while the MPC performs rolling optimization and real-time control during online operation based on these optimized parameters, thereby balancing control performance with computational efficiency. The schematic diagram illustrating the tuning process of initial control parameters based on the MOWOA is shown in Figure 3.

4. Model Predictive Controller

4.1. Dynamic Matrix Prediction Model

In recent years, the MPC has emerged as a powerful control strategy in urban rail transit systems due to its ability to predict future behavior and optimize control actions within system constraints. For the train tracking control, where precise speed and position tracking are essential for safety and energy efficiency, the MPC provides a systematic framework to handle multi-variable dynamics and time-varying operating conditions.

Among the many variants of MPC, dynamic matrix control (DMC) is one of the earliest and most practically implemented algorithms [34]. Although the proposed control strategy in this study is referred to as a model predictive controller, the implementation specifically adopts the DMC approach. This paper proposes MPC-MOWOA that incorporates online adjustment of control parameters to enhance adaptability and tracking performance in dynamic urban rail environments.

The DMC uses the step response of the system as the internal model to predict future outputs, which enables accurate modeling of plant dynamics using only a finite number of response samples. Given a modeling horizon of length N, the system’s dynamic properties can be described using the N discrete samples $[eqn]$ obtained from its unit step response. The prediction of the model’s output is carried out via the following formulation.

[eqn]

where $[eqn]$ represents the predicted output generated by the model at time step k; $[eqn]$ represents the initial value of the model; $[eqn]$ represents the control incremental sequence vector; $[eqn]$ represents the dynamic matrix, i represents the index value at some future time in the modeling time domain, $[eqn]$ .

4.2. Rolling Optimization

To effectively avoid severe fluctuations during the control process, the current output value $[eqn]$ of the DMC system should follow a predefined smooth reference trajectory toward the target value $[eqn]$ , thereby enhancing the robustness of the system. A commonly used form of such a reference trajectory is described below.

[eqn]

where $[eqn]$ represents the final output value expected to be achieved; $[eqn]$ represents the i-th softening factor, $[eqn]$ ; $[eqn]$ represents the actual output value of the system at the current control cycle (the k-th cycle); and $[eqn]$ represents the reference target value of the system, which is also the setpoint input value [35].

The formulation of a secondary objective function is critical to enhancing tracking performance within the rolling optimization framework of DMC. Assuming the length of the optimization horizon is P, the length of the control horizon is L, and $[eqn]$ , the following presents the detailed formulation of the secondary objective.

[eqn]

where $[eqn]$ denotes the error weighting matrix; $[eqn]$ denotes the control increment weighting coefficient matrix; $[eqn]$ $[eqn]$ denotes the anticipated control inputs sequence.

According to the extreme value theory, under unconstrained conditions, the essential condition for the objective function Y to achieve its minimum is $[eqn]$ . Thus, the optimal solution for the control sequence can be obtained through receding horizon optimization [36]. The optimal control sequence is computed according to the following formula.

[eqn]

On this basis, the actual control input $[eqn]$ is obtained. The detailed computational equation for the $[eqn]$ is described as follows.

[eqn]

During the following control interval, the $[eqn]$ -th cycle, the matching $[eqn]$ and $[eqn]$ is obtained using the aforementioned method. By repeating this process iteratively, rolling optimization of the actual control inputs can be achieved.

Although the current design does not explicitly include Lyapunov-based terminal constraints, these could be incorporated in future extensions to ensure closed-loop stability while optimizing multi-objective performance.

4.3. Feedback Correction

Any practical control system is subject to stochastic and uncertain disturbances, which increase the difference between the system’s real-time output and the expected reference, potentially leading to system instability. Feedback correction is a critical component of predictive control and is employed to mitigate the impact of disturbances and uncertainties on control performance, thereby achieving the desired control objectives.

The error $[eqn]$ , representing the discrepancy between the real-time output $[eqn]$ and its predicted counterpart $[eqn]$ at step k, is computed as.

[eqn]

Incorporating feedback correction, the predicted output $[eqn]$ is accordingly modified. The corresponding correction formula is formulated as follows:

[eqn]

where H represents the correction vector.

If the filter is designed in a first-order form, by $[eqn]$ , the correction coefficients are represented $[eqn]$ . In the context of the automatic train operation (ATO) system, the filter in the feedback correction part is typically designed as a first-order filter. In this case, the correction vector H is determined using the method $[eqn]$ . Here, the length of the correction vector H is N, and $[eqn]$ .

In the k-th cycle, the future prediction time points are shifted to $[eqn]$ . This means that the prediction values for the k-th cycle can be obtained by simply shifting $[eqn]$ , $[eqn]$ . Here, S represents the transformation matrix, where $[eqn]$ .

4.4. Online Adjustment of Softening Factor

In practical engineering applications, the urban rail train tracking control problem is inherently fuzzy in nature. To address this issue, a fuzzy satisfaction function is defined to quantify the degree to which the control objectives are achieved, and the control problem is reformulated as an optimization-based decision-making problem through fuzzy inference [37]. Specifically, the fuzzy satisfaction function consists of two components.

The first component is the system response time, which reflects the dynamic tracking capability of the control system with respect to the reference command. Its magnitude is closely related to the error between the actual output and the setpoint, as well as the rate of change of this error. The second component is the fuzzy satisfaction of the predicted outputs. At each sampling instant, the outputs at future time instants within the prediction horizon are evaluated, and corresponding membership functions are defined to characterize the degree of closeness between the predicted system behavior and the desired trajectory.

By combining these two components, a comprehensive fuzzy satisfaction index is obtained, which serves as a quantitative measure of the overall control performance. Based on this index, the softening factor can be adjusted online in real time, enabling the control system to adaptively balance speed tracking accuracy, energy consumption optimization, and ride comfort during dynamic operation, thereby enhancing the flexibility and stability of trajectory tracking control.

First, The system response time is related to the error between the actual output and the setpoint, as well as the rate of change of the error. For any given sampling time t, the estimated system response time $[eqn]$ is calculated using the following formula:

[eqn]

where $[eqn]$ and $[eqn]$ denote the error between the actual output and the setpoint, as well as the variation rate of the error signal, respectively; $[eqn]$ denotes the estimated system response time, defined as the time required for the system to reach the setpoint based on the absolute value of the current error derivative $[eqn]$ .

When conditions $[eqn]$ are satisfied, the system is considered to be in a stable state, with a system response time of zero. When condition $[eqn]$ occurs, the deviation from the setpoint in the system output shows a decreasing trend, and the system response time is calculated using formula $[eqn]$ . When conditions $[eqn]$ are fulfilled, the deviation from the setpoint in the system output shows an increasing trend, indicating that the system output requires a relatively long time to reach the setpoint. Therefore, the system response time is assigned a large positive value M.

Based on the inferred system response duration t at time $[eqn]$ , a fuzzy mapping function can be constructed to evaluate the degree of satisfaction with respect to how well the response time aligns with control objectives.

[eqn]

where $[eqn]$ and $[eqn]$ represent the fuzzy widths, indicating the degree to which the design requirements are satisfied. $[eqn]$ and $[eqn]$ represent the upper and lower bounds, respectively, of the desired system response time. As depicted in Figure 4, the fuzzy satisfaction level $[eqn]$ is defined over the estimated system response time $[eqn]$ via a membership function.

Second, a corresponding satisfaction function can be formulated for the forecasted outputs $[eqn]$ across future prediction steps following time t.

[eqn]

where $[eqn]$ and $[eqn]$ represent the fuzzy widths, indicating the degree to which the system meets the designer’s requirements. If $[eqn]$ and $[eqn]$ are both 0, it implies that the requirements for the system control objectives are strict. In the context of ATO tracking control, neither $[eqn]$ nor $[eqn]$ would be zero. $[eqn]$ and $[eqn]$ represent the upper and lower bounds of the desired design values, respectively. The membership function diagram illustrating the relationship between the forecasted output value $[eqn]$ and its fuzzy satisfaction degree $[eqn]$ is shown in Figure 5.

Based on fuzzy satisfaction degrees $[eqn]$ , and fuzzy inference can be applied to obtain the fuzzy satisfaction index $[eqn]$ for the control system’s performance in meeting the control objectives at the sampling time.

[eqn]

If the fuzzy satisfaction index $[eqn]$ is larger, it indicates that the system error is smaller, but the response time is longer. In this case, the softening factor should be appropriately reduced. On the contrary, if the fuzzy satisfaction index $[eqn]$ is smaller, when the system exhibits a larger control error despite a shorter response time, it becomes necessary to moderately increase the softening factor. To this end, an online adjustment mechanism for $[eqn]$ is developed based on the fuzzy satisfaction index $[eqn]$ . The corresponding computational expression is formulated as follows.

[eqn]

where $[eqn]$ and $[eqn]$ denote the upper and lower limits of the softening coefficient, respectively. The term $[eqn]$ acts as the gain parameter, influenced jointly by these boundary values. The variable $[eqn]$ serves as a shaping factor that governs the curvature of the adjustment function. e is the natural logarithm, where $[eqn]$ .

4.5. Online Adjustment of the Correction Coefficient

In general, for the feedback filter $[eqn]$ the larger the correction coefficient $[eqn]$ , the slower the system response, and the less sensitive it is to model mismatch; conversely, the smaller the correction coefficient $[eqn]$ , the faster the system response, but the more likely it is to cause system oscillations. Based on this, this paper proposes an online adjustment strategy for the correction coefficient. When the absolute error is large and the error is rapidly decreasing, as indicated by a relatively large negative rate of change, the system is responding too quickly. In this case, the response speed should be moderately reduced to avoid inducing oscillations, and the calibration parameter should be minimized. On the other hand, when the error is small but increasing rapidly, which corresponds to a large positive rate of change, it indicates a slow response. To prevent error amplification, the system’s response speed should be appropriately increased, and the calibration parameter should be set to a larger value. The correction coefficient adjustment $[eqn]$ is determined jointly by the absolute value of the error $[eqn]$ and the rate of change of the error $[eqn]$ , The specific fuzzy control rules are shown in Table 2.

The Table 2 defines the fuzzy inference rules used to determine the adjustment quantity of the correction coefficient based on the error $[eqn]$ and the error variation rate $[eqn]$ . Each table cell corresponds to a fuzzy rule output linked to linguistic labels spanning from negative big to positive big. The rules are constructed to ensure that the correction coefficient is adjusted adaptively according to the system’s error dynamics, thereby improving control performance.

The calculation formula for the online-adjusted correction coefficient is given as follows:

[eqn]

where $[eqn]$ represents the initial value of the correction coefficient.

As illustrated in Figure 6, the block diagram of the DMC controller is presented to provide a clear and intuitive understanding of its working principle and the interactions among its constituent modules. The diagram depicts the computational logic of the control process, including future output prediction, error evaluation, and the derivation of the optimal control sequence.

5. Hardware-in-the-Loop Simulation Experiment for Urban Rail Train Tracking Control

5.1. Primary Parameters for Urban Rail Train Tracking Control

This study adopts the scenario from the Dalian Urban Rail Transit Line 12, specifically the section between Lushun New Port and Tieshan Town, as the simulation object. The Dalian Rail Transit Line 12 extends from Hekou in Dalian High-Tech Industrial Zone to Lushun New Port in the Lvshun Economic and Technological Development Zone, comprising a total of eight stations. The main parameters and idealized operation results for urban rail train tracking control are listed in Table 3 and Table 4, respectively. In this study, the expected running time refers to the time required for a train to travel from the departure of the previous station to the arrival at the next station according to the schedule under normal operating conditions. This time is determined by the urban rail transit operator based on multiple factors, including line design, signaling systems, inter-station distance, and vehicle performance, and is approved by the relevant authorities as the basis for train scheduling and operation. The operations department is responsible for executing train operations according to this scheduled time. In train tracking control, the primary objective is to ensure that the actual arrival time closely follows the expected running time, thereby guaranteeing both operational safety and comfort.

As illustrated in Figure 7, the longitudinal profile of the section from Lvshun New Port to Tieshan Town comprises multiple gradient segments, speed-restricted zones, and station-related constraints. These factors jointly define the admissible speed envelope during train operation. The figure explicitly presents the spatial distribution of track gradients, the locations of speed limits, and station positions along the route distance, thereby providing the fundamental line characteristics and constraint inputs required for train speed trajectory planning.

As shown in Figure 8, the planned target speed trajectory strictly complies with both the maximum permissible line speed and the safety speed limits over the entire route. Pronounced deceleration behaviors are observed in speed-restricted sections and station approach areas, reflecting the imposed operational safety and stopping requirements. In contrast, within unconstrained sections, the train speed increases smoothly to balance operational efficiency and passenger comfort. Meanwhile, the corresponding operating-condition-distance curve characterizes the switching process among different operating modes, including traction, cruising, coasting, and braking. Consequently, the target speed trajectory and the operating-condition curve jointly constitute the ideal reference profiles adopted in the subsequent train speed tracking control design.

It should be noted that the above target speed trajectory is obtained under nominal operating conditions and idealized model assumptions. In practical train speed tracking control, the system is inevitably affected by variations in train mass, fluctuations in track resistance, uncertainties in vehicle dynamic parameters, and external disturbances. In addition, model mismatches and measurement errors may persist within the control system, leading to localized speed fluctuations along certain route segments. Therefore, in engineering practice, a reasonable level of deviation between the actual speed trajectory and the target profile is generally acceptable. The essential objective of the control strategy is to effectively suppress error growth and maintain the actual tracking trajectory in close proximity to the reference trajectory, thereby ensuring safe, smooth, and high-performance train operation.

5.2. Parameter Settings of the Scenario in This Study

To meet the practical requirements set by locomotive depots and metro operators, the performance parameters listed in Table 5 are determined based on actual operational standards. Specifically, the route length of 2940 m is obtained through field measurements, and the scheduled travel time of 195 s is defined according to the dispatching plan. The five performance indicators, including energy consumption, comfort level, tracking control comfort, distance error, and time error, are derived from routine operational data and validated computational models. These parameters serve as essential references for evaluating system performance and supporting subsequent optimization and control strategy development.

In this study, the parameters of the MPC and the MOWOA were derived from validated empirical values found in relevant literature and then appropriately refined to align with the operational objectives and constraints of the proposed control system [38]. Specifically, the following parameters were used: the train speed sampling period is 0.5 ms, with a control period of 1 ms. However, in practical engineering applications, the control cycle may be extended due to limitations in hardware processing capability and the presence of communication delays. Even so, it can still meet the typical requirements of actual train management systems, with control cycles of 50 ms or 100 ms, thereby ensuring rail vehicle speed tracking accuracy and overall system stability. The model length is set to 60, and both the prediction horizon and control horizon are set to 15. It should be noted that the model length, prediction horizon, and control horizon are all defined in units of discrete sampling steps (dimensionless), where each step corresponds to a sampling period of 0.5 ms. Therefore, the model horizon is equivalent to $[eqn]$ , while both the prediction and control horizons correspond to $[eqn]$ . The fuzzification width parameters are $[eqn]$ and $[eqn]$ as 0.003 s, and $[eqn]$ and $[eqn]$ as 0.05 km/h. The expected upper and lower bound values for $[eqn]$ and $[eqn]$ are 0.005 s and 0.08 s, respectively, and for $[eqn]$ and $[eqn]$ , they are $[eqn]$ km/h and $[eqn]$ km/h. The target speed $[eqn]$ is also defined. The whale population size NPw is 40, the number of iterations $[eqn]$ is 100, the probability of whales selecting prey-encircling behavior $[eqn]$ is 0.6, and the rate adjustment factor $[eqn]$ for the convergence factor a is 1.72 [39].

On this basis, in order to further improve system performance in terms of the five defined indicators, the control parameters of the MPC were optimized using the MOWOA. The initial parameter values were selected based on validated empirical studies and relevant literature, and were then refined through MOWOA to meet the multi-objective performance requirements of the proposed control system. The optimized control parameters are shown in Table 6. These parameters are fine-tuned to adaptively meet the multi-objective performance requirements under varying urban rail operating conditions. Furthermore, the corresponding membership functions for speed error, speed error rate, and correction coefficient adjustments are illustrated in Figure 9 below.

5.3. Design and Configuration of the Hardware-in-the-Loop Simulation Experiment

In this study, a hardware-in-the-loop simulation experiment (HILSE) incorporating a traction motor was established and implemented to validate the effectiveness of the proposed MPC strategy for urban rail train operation. The HILSE essentially constructs a virtual train operating environment, in which motor speed tracking is realized under a predefined speed scaling relationship, thereby emulating the train speed trajectory tracking process. Compared with purely numerical simulations conducted in a Simulink environment, the HILSE approach transforms the idealized signal connections and functional blocks in the control model into a physical closed-loop system consisting of real measurement, computation, actuation, and mechanical load components. As a result, a complete practical signal transmission chain and energy flow path are introduced into the control process, enabling the experimental platform to more closely resemble real-world engineering systems.

In practical urban rail applications, train speed trajectory tracking involves multiple performance objectives, time delays, and nonlinearities. Disturbances arise from both external operating conditions and intrinsic system characteristics. By integrating physical hardware into the simulation, the HIL methodology enhances the realism and credibility of controller performance evaluation compared to model-based simulations alone. The experimental platform, incorporating a host computer and HMI, enables control parameter deployment, data acquisition, and real-time monitoring, while speed commands, mode switching, and emergency stop interlocks ensure operability, repeatability, and safety-features often idealized in pure simulations. Moreover, the PMSM speed regulation system exhibits multi-objective dynamics, inherent delays, and strong nonlinearities, including electromagnetic effects (e.g., magnetic saturation) and mechanical influences (e.g., bearing friction, structural vibration), which are the main sources of disturbances and introduce uncertainties into the HILSE for urban rail train speed tracking.

The HILSE involve physical components such as controllers, and typically require embedding the control algorithm into the controller’s core chip while constructing a corresponding virtual trains environments [40]. This setup enables a more effective performance evaluation of the control strategy. The HILSE for urban rail train tracking control offers several notable advantages. First, it can realistically simulate network transmission delays as well as packet errors or losses. Second, it is capable of accurately replicating internal model mismatches and external disturbances commonly present in actual control environments. This allows the system to identify transfer functions that closely approximate real-world behavior. Third, the system can reflect sampling inaccuracies in speed and torque measurements [41]. Finally, compared with full-scale train experiments, the HILSE features lower experimental costs, more accessible test conditions, and reduced safety protection requirements [42]. In the development and production of actual tracking controllers, the HILSE play an indispensable role in evaluating the performance of tracking control algorithms.

As shown in Figure 10, both the speed tracking controller and the torque loading controller for the virtual environment are composed of a 24 V circuit board power supply, a driver circuit board, and a control circuit board that includes a core processing chip and an LCD display. These components are connected to the supervisory host computer via serial communication cables. The host computer is used to receive real-time feedback data from the tracking control operation and to send real-time torque loading commands corresponding to the virtual operating environment. Based on the host computer software interface and the LED display on the control board shown in Figure 10, the current position and speed of the train, as well as their deviations from the target position and speed, can be clearly observed.

The specific configuration of the HILSE used for urban rail train tracking control in this study is described below. The supervisory host computer is equipped with a CPU Core i9-7920X and runs MATLAB version 2016b. Both the speed tracking controller and the torque loading controller for the virtual operating environment are based on the TMS320F28335 core CPU chip. The programs for these controllers were developed using Code Composer Studio (CCS) version 5.5. The model of the LCD display on the control circuit board is 12864B V2.0, and communication is implemented via the RS-485 protocol. The torque and speed sensor, along with its measurement device, is model No. JN338. The sensor features a speed measurement range of 6000 rad/min and a torque measurement range of 20 $[eqn]$ . Both the tracking control actuator and the virtual environment actuator use permanent magnet synchronous motors (PMSMs) with a rated power of 750 W. These motors have a rated voltage of 220 V, a rated torque of 2.4 $[eqn]$ , and a peak current of 4.2 A. The operating voltage is 100 V, and the simulated speed-to-distance ratio is 10 rad/min to 1 km/h. The speed tracking control system for the PMSMs employs a triple closed-loop vector control structure. The current loop uses proportional-integral (PI) control, the inner speed loop adopts active disturbance rejection control (ADRC), and the outer torque loop utilizes the MPC introduced in this paper. For comparison purposes, other conventional tracking control algorithms were also implemented under the same framework.

5.4. Discussion and Analysis of HILSE Results for Urban Rail Train Tracking Control

Based on the scenario, to verify the effectiveness of the proposed MPC-MOWOA, the urban rail train tracking control was conducted using three control strategies: MPC optimized by PSO(MPC-PSO) [43], conventional MPC [44], and Fuzzy-PID control [45]. The Fuzzy-PID controller integrates PID control with fuzzy logic, adjusting PID parameters online through fuzzy rules to effectively handle system nonlinearity and disturbances. It was selected as a comparative algorithm because of its widespread application and representativeness in urban rail train speed control, providing a benchmark for evaluating the performance of the proposed predictive control method and highlighting its advantages in constraint handling and speed tracking accuracy. All comparative algorithms were tested under the same simulation environment, train model, operating conditions, and evaluation metrics, and fair and engineering-reasonable comparisons were ensured through appropriate parameter tuning. The simulation results are shown in Figure 11, Figure 12 and Figure 13 and Table 7 and Table 8.

As shown in Figure 11, Figure 12 and Figure 13, during the HILSE, the power supplies for the tracking control system, its driver, and the controller were activated. The real-time tracking control process satisfied the specified tracking constraints, data sampling and communication were smooth, and the host computer enabled real-time supervision of the tracking control process. Compared with other control algorithms used for comparison, the MPC-MOWOA is proposed in this study, which incorporates online adjustment of control parameters, demonstrated superior tracking performance.

As illustrated in Figure 11, Figure 12 and Figure 13, the MPC-MOWOA proposed in this study can adjust control parameters online based on real-time urban rail train tracking conditions, ultimately achieving better overall tracking control performance for urban rail train. The Figure 11 shows that although all four sequentially applied tracking control algorithms enable the train to operate safely and stably throughout the entire route, the proposed MPC-MOWOA method achieves a speed tracking profile that is closer to the target trajectory. It also demonstrates smoother speed response, smaller tracking errors, and effectively avoids the overshoot observed in static MPC. This performance advantage primarily stems from the combination of the online adjustment mechanism of the softening factor and the MPC parameters optimized by MOWOA, enabling the controller to better adapt to load disturbances and dynamic constraint variations caused by track grade changes, thereby maintaining higher trajectory tracking accuracy and control stability under complex operating conditions. The Figure 12 indicates that the proposed algorithm produces a tracking speed curve that better matches the target speed curve, with smaller speed errors at most time points. The Figure 13 demonstrates that the MPC-MOWOA generates a tracking time curve that is more consistent with the target time trajectory, and the actual stopping point is closer to the target stopping point (195 s, 2940 m), indicating better punctuality and precision in stop control performance.

Further analysis of Figure 11, Figure 12 and Figure 13 indicates that the proposed MPC-MOWOA exhibits more pronounced performance advantages in complex operating sections characterized by significant gradient variations and frequent transitions between traction and braking. In these sections, compared with the other control methods, the proposed approach achieves a smoother speed response and smaller tracking errors. This improvement can be attributed to the MPC parameters optimized by MOWOA, which enhance the controller’s ability to cope with load disturbances induced by gradient changes and variations in dynamic constraints, thereby maintaining higher trajectory tracking accuracy and control stability under complex operating conditions.

In Table 8, the $[eqn]$ represents the cosine of the angle between the weight vector $[eqn]$ of the optimal solution x and the target weight vector $[eqn]$ . Moreover, although $[eqn]$ is a single performance index, it can reflect comprehensive system performance.

As shown in Table 7 and Table 8, where the corresponding references of the compared control algorithms are explicitly provided in Table 7 and remain applicable to Table 8 due to identical algorithm configurations, with the Fuzzy-PID adopted as the reference benchmark, the proposed MPC-MOWOA demonstrates clear performance advantages across multiple key evaluation metrics. The results show that MPC-MOWOA achieves an energy consumption reduction of approximately 5.9%, indicating a notable improvement in energy utilization efficiency during train operation. With respect to speed tracking and stopping control accuracy, the time error and distance error are reduced by about 33.2% and 12.4%, respectively, which reflects enhanced tracking precision as well as improved stopping performance. In addition, the comfort index is improved by nearly 7.1%, suggesting that the proposed method effectively suppresses fluctuations during acceleration and deceleration processes, thereby enabling smoother train operation. From the perspective of overall control performance, the $[eqn]$ obtained using MPC-MOWOA is reduced by nearly 39.6% compared with that of the Fuzzy-PID, indicating a substantial enhancement in global tracking control quality. Moreover, the proposed approach exhibits improved multi-objective coordination capability, achieving a more balanced trade-off among competing performance objectives than the benchmark method.

Additionally, the simulation results indicate that MPC-MOWOA maintains strong comprehensive performance under complex operating conditions. By conducting coordinated multi-objective optimization of MPC parameters, the proposed method achieves evident improvements in speed tracking accuracy, energy consumption, and comfort. Although practical implementations may be influenced by factors including measurement noise, model uncertainties, and communication or computational delays, which may lead to some performance degradation compared with ideal simulation results, the overall control trends and performance improvement characteristics are expected to remain robust.

Overall, under various comparative criteria such as energy consumption, tracking accuracy, comfort, and comprehensive performance, MPC-MOWOA consistently delivers superior control performance. These results confirm the effectiveness and advantages of the proposed method in addressing the urban rail train tracking control problem, while also highlighting its potential value for practical engineering applications.

6. Conclusions

This paper proposes an optimized model predictive control using a multi-objective whale optimization algorithm (MPC-MOWOA) for urban rail train tracking control, with the objective of achieving coordinated optimization among multiple performance criteria, including tracking accuracy, energy consumption, punctuality, stopping precision, and comfort. Within the proposed control framework, an adaptive regulation mechanism based on a fuzzy satisfaction function is introduced to dynamically tune the softening factor in MPC. Meanwhile, the control parameters are adjusted online according to the speed tracking error and its rate of change, thereby effectively enhancing system stability and dynamic adaptability in the presence of model uncertainties, external disturbances, and varying operating conditions.

Considering the problems encountered in MPC parameter tuning, such as strong coupling among parameters, difficulty in manual tuning, and conflicts among multiple performance objectives, a MOWOA is used to optimize the MPC parameters. Furthermore, the algorithm is improved by incorporating a nonlinear convergence mechanism and the Tchebycheff decomposition method, which transforms the original multi-objective optimization problem into a set of scalar subproblems, thereby improving global search capability and convergence stability. Hardware-in-the-loop simulation experiments (HILSEs) conducted on the Lvshun New Port-Tieshan section of Dalian Metro Line 12 demonstrate that the proposed MPC-MOWOA achieves superior performance in terms of $[eqn]$ , energy consumption, punctuality, stopping accuracy, and comfort when compared with the benchmark methods. Moreover, the feasibility of this validation approach has been confirmed on an embedded processor (TMS320F28335), overcoming the common limitation in related studies that rely solely on theoretical MATLAB simulations. These results confirm the effectiveness, stability, and practical engineering value of the proposed approach in complex urban rail transit operating environments.

From the perspective of practical engineering applications, the proposed MPC-MOWOA framework is explicitly oriented toward real-world implementation. The multi-objective optimization process is carried out entirely in an offline environment and therefore does not impose any additional computational burden during the online operation of the MPC. Moreover, the proposed approach can be incorporated into existing ATO systems through parameter configuration, without requiring modifications to the core control architecture or the introduction of additional hardware resources. This characteristic ensures good compatibility with current train control systems and supports its practical deployability in engineering applications.

Although the proposed MPC-MOWOA demonstrates satisfactory performance in the conducted simulations, certain limitations still exist and require further investigation.

(I)The experimental validation in this study is currently limited to a hardware-in-the-loop simulation environment (HILSE), and no on-site experiments have been conducted on real urban rail train systems. Although HILSE can reflect the dynamic characteristics of control algorithms in an engineering-oriented environment to a certain extent, its idealized conditions cannot fully capture non-ideal effects encountered in real operation, such as sensor noise, sampling delays, actuator dead zones, and long-term operational disturbances. Consequently, the reliability and robustness of the proposed algorithm in real train operations still require further verification.(II)The longitudinal train dynamics model adopted in this study is a linear or quasi-linear approximation used for state prediction and control optimization within the MPC framework. This model is obtained through system identification or empirical calibration and can represent the average dynamic behavior under typical operating conditions. However, it may not fully capture extreme nonlinear phenomena, such as wheel slip, wheel spin, or emergency braking scenarios, which constitute a form of model mismatch.(III)The MOWOA may incur a significantly increased computational burden in high-dimensional objective spaces or large-scale train operation scenarios. Although the scenarios designed in this study satisfy the real-time requirements of the current HILSE platform, real-time control performance may be constrained in more complex or large-scale practical applications.(IV)In this study, the online adjustment of the softening factor and correction coefficients is based on predefined fuzzy satisfaction indices and velocity error characteristics, without incorporating fully adaptive or learning-based adjustment mechanisms. As a result, when operating conditions vary significantly, the proposed method may exhibit limitations in terms of long-term adaptability and generalization capability.

In view of the limitations discussed above, future research directions can be outlined as follows:

(I)Future studies could consider conducting experimental validation on actual urban rail trains or on more advanced HILSE platforms, in order to systematically evaluate the feasibility, reliability, and long-term stability of the proposed algorithm under real operating conditions. Additionally, upgrading the hardware platform to enhance the performance of embedded processors would provide support for the real-time implementation of more complex control algorithms.
(II)The current linear approximation model may not fully capture extreme nonlinear behaviors, such as wheel slip, spin, or emergency braking. Future research could incorporate nonlinear modeling, online system identification, or adaptive modeling techniques to improve the representation of complex operating conditions, thereby enhancing the robustness of the MPC under model uncertainties.
(III)Future work could explore fast optimization algorithms, distributed computing architectures, or adaptive multi-objective optimization methods to address the computational complexity associated with high-dimensional or large-scale train operation scenarios. Furthermore, integrating urban rail energy efficiency optimization with modern energy systems and intelligent logistics could allow investigation of coordinated operation strategies among train loads, power supply systems, and external energy networks (e.g., hydrogen, electric grid), thereby expanding the practical relevance and application value of the research.
(IV)In the present study, the multi-objective weights are kept fixed during control. Future research could explore dynamic or adaptive weight allocation strategies to achieve more flexible and efficient coordination among multiple performance indices, such as energy consumption, comfort, punctuality, and stopping accuracy, thus further enhancing the overall performance, robustness, and adaptability of the control system under complex operating conditions.

Bibliography45

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Giannelos S. Konstantelos I. Pudjianto D. Strbac G. The impact of electrolyser allocation on Great Britain’s electricity transmission system in 2050 Int. J. Hydrogen Energy 202620215309710.1016/j.ijhydene.2025.153097 · doi ↗
2Zhou H. Wan Y. Ye H. Li B. Liu B. A novel real-time algorithm for optimizing train speed profiles under complex constraints IEEE Trans. Intell. Transp. Syst.2023247987800210.1109/TITS.2023.3264795 · doi ↗
3Havaei P. Sandidzadeh M.A. Intelligent-PID controller design for speed track in automatic train operation system with heuristic algorithms J. Rail Transp. Plan. Manag.20222210032110.1016/j.jrtpm.2022.100321 · doi ↗
4Bai Y. Cao Y. Yu Z. Ho T.K. Roberts C. Mao B. Cooperative control of metro trains to minimize net energy consumption IEEE Trans. Intell. Transp. Syst.2019212063207710.1109/TITS.2019.2912038 · doi ↗
5Aribowo W. The Conceptual Understanding of Metaheuristic Algorithms: A Brief Reviews Vokasi Unesa Bull. Eng. Technol. Appl. Sci.20263111
6Sabo A. Bawa M. Yakubu Y. Ngyarmunta A.A. Aliyu Y. Musa A. Katun M. PID controller tuning for an AVR system using Particle Swarm Optimisation Techniques and Genetic Algorithm Techniques; A comparison based approach Vokasi Unesa Bull. Eng. Technol. Appl. Sci.2025227028010.26740/vubeta.v 2i 2.36821 · doi ↗
7Hentout A. Maoudj A. Kouider A. Shortest path planning and efficient fuzzy logic control of mobile robots in indoor static and dynamic environments Sci. Technol.202427213610.59277/ROMJIST.2024.1.02 · doi ↗
8Wen J. Wang F. Stable levitation of single-point levitation systems for maglev trains by improved cascade control Sci. Technol.20242734836110.59277/ROMJIST.2024.3-4.08 · doi ↗