Multi-UAV Cooperative Path Planning Using a Behavior-Adaptive Aquila Optimizer Under Multiple Constraints
Xiaojie Tang, Chengfen Jia, Pengju Qu, Qian Zhang, Pan Zhang

TL;DR
This paper introduces a new optimization algorithm, EAO, which improves multi-UAV path planning by adapting behaviors during the search process.
Contribution
The novel Behavior-Adaptive Aquila Optimizer (EAO) integrates multiple strategies to enhance search performance in constrained multi-UAV path planning.
Findings
EAO outperformed 13 other optimization algorithms on benchmark tests and achieved the best overall performance ranking.
EAO improved average fitness by over 80% compared to the original AO across different map environments.
EAO demonstrated superior performance in path-planning simulations with UAV dynamic constraints.
Abstract
Addressing the challenges of high dimensionality, strong nonlinearity, and multiple constraints in multi-UAV cooperative path planning, this paper proposes a Behavior-Adaptive Aquila Optimizer (EAO) achieved by enhancing Aquila Optimizer (AO). EAO constructs a multi-strategy cooperative framework that integrates a periodic diversity maintenance mechanism, a diversity-based dynamic neighborhood guidance mechanism, a narrowed exploitation behavior based on neighborhood differential evolution, and a search-state-aware adaptive behavior selection mechanism. Through dynamic behavior adjustment during the search process, the proposed algorithm improves search performance and stability. To validate its effectiveness, EAO was systematically evaluated on the CEC2017 and CEC2020 benchmark suites and compared with the original AO and 13 representative high-performance optimization algorithms.…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11
Figure 12
Figure 13
Figure 14
Figure 15
Figure 16
Figure 17
Figure 18
Figure 19
Figure 20
Figure 21
Figure 22
Figure 23
Figure 24
Figure 25
Figure 26
Figure 27
Figure 28
Figure 29
Figure 30
Figure 31
Figure 32
Figure 33- —National Natural Science Foundation of China
- —Science and Technology Program Projects of Meishan Municipal Science and Technology Bureau
- —Guizhou Provincial Department of Science and Technology Projects
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Path Planning Algorithms · UAV Applications and Optimization · Metaheuristic Optimization Algorithms Research
1. Introduction
In recent years, unmanned aerial vehicles have been widely applied in military reconnaissance, disaster relief, logistics distribution, and complex environment inspection due to their advantages of high mobility, flexible deployment, and relatively low cost. As task scales expand and operational environments become more complex, single-UAV systems have gradually exposed deficiencies in efficiency and robustness when performing multi-target, high-risk tasks [1,2]. Multi-UAV cooperative task execution has thus become an important research direction in the field.
Path planning is a core problem in multi-UAV systems that affects mission success rate and system safety. Effective path planning must not only ensure that UAVs can avoid threat zones and no-fly areas in complex environments, but also comprehensively consider flight energy consumption, task coordination efficiency, and overall system stability. Consequently, multi-UAV path planning is typically characterized as a complex optimization problem with high dimensionality, strong nonlinearity, and multiple constraints [3], placing high demands on solution algorithms.
Traditional deterministic methods for multi-UAV path planning, such as graph-search or mathematical programming approaches, including Dijkstra [4], A-star [5], D* Lite [6] algorithms, and mixed-integer linear programming [7], generally rely on precise modeling and problem-structure assumptions. When dealing with high-dimensional continuous spaces and complex constraints, these methods often suffer from high computational complexity, poor scalability, and susceptibility to local optima. In contrast, swarm intelligence optimization algorithms have gradually become a significant research direction in UAV path planning due to their strong global search capabilities and lower dependency on problem modeling [8].
In recent years, particle swarm optimization (PSO, 1995) [9], differential evolution (DE, 1995) [10], whale optimization algorithm (WOA, 2016) [11], moth–flame optimization (MFO, 2015) [12], and various emerging bio-inspired swarm intelligence algorithms, such as golden jackal optimization (GJO, 2024) [13], dung beetle optimizer (DBO, 2023) [14], and secretary bird optimization algorithm (SBOA, 2024) [15], have been widely applied to multi-UAV cooperative path-planning problems. Qi et al. [16] combined the artificial bee colony algorithm with particle swarm optimization and proposed an improved bee-foraging learning particle swarm optimization algorithm (IBFLPSO). The algorithm was applied to multi-UAV path-planning problems considering multiple energy consumption factors. Zheng et al. [17] proposed a differential evolution-based multi-UAV cooperative coverage algorithm (DECSMU) to address coverage tasks in different regions. Experimental results showed that DECSMU achieved high coverage rates and low energy consumption under collision avoidance constraints. Yu et al. [18] applied a whale optimization algorithm enhanced by simulated annealing (SA-WOA) to multi-UAV cooperative atmospheric sensing problems. The experimental results demonstrated that the proposed method exhibited good planning performance. Karthik et al. [19] proposed a hybrid golden jackal and moth–flame optimization algorithm (HGJMFOA) for coverage path planning, which was used to generate optimal paths supporting efficient and complete multi-UAV coverage. Lou et al. [20] proposed a novel multi-UAV three-dimensional terrain cooperative trajectory planning method based on a cuckoo search golden jackal optimization algorithm (CS-GJO). The simulation results verified that CS-GJO achieved better stability, higher optimization accuracy, and faster convergence. Yang et al. [21] proposed a landmark operator-based dung beetle optimizer (LODBO) by integrating multiple improvement strategies, and extensive simulations showed that LODBO could find optimal paths within shorter time. Zheng et al. [22] developed an improved secretary bird optimization algorithm incorporating weighted multidirectional dynamic learning and adaptive strategy selection mechanisms. The simulation results indicated that the proposed ASHSBOA generated lower-cost flight paths with more stable convergence behavior. Although these methods have improved path-planning performance to some extent, they still suffer from insufficient search efficiency, unstable solution quality, and premature convergence when dealing with high-dimensional search spaces, complex threat environments, and multi-UAV cooperative constraints.
Aquila Optimizer (AO) [23] is a novel bio-inspired swarm intelligence optimization algorithm inspired by the diverse flight and attack behaviors of the Aquila during hunting. By simulating different behavior strategies at various search stages, AO switches between exploration and exploitation, resulting in a clear and intuitive search mechanism. The existing studies have shown that AO exhibits good global search capability on various continuous optimization benchmark functions. However, as problem dimensionality and complexity increase, the original AO still has several limitations [24,25]. These include: (1) rapid loss of population diversity during iterations, which easily leads to premature convergence; (2) a relatively fixed behavior selection mechanism that cannot adapt dynamically to the search state; and (3) insufficient global search ability and stability when addressing high-dimensional and complex optimization problems.
To address these issues, many researchers have proposed improvements to the AO algorithm. For multi-area power system networks, Al-Majidi et al. [26] designed a hybrid Aquila Optimizer–sine cosine algorithm (HSCAO) to solve the sensitivity problem of PID-AGC parameters. Bai et al. [27] developed an improved Aquila Optimizer (MIAO) by introducing phasor operators and other strategies and applied it to hyperparameter optimization of long short-term memory (LSTM) networks. This led to the construction of an MIAO-LSTM model for monthly railway freight volume prediction. Wang et al. [28] applied an optimal scheduling strategy based on a differential mutation Aquila Optimizer integrated within a multi-objective optimization model considering total cost and greenhouse gas (GHG) emission constraints. To address the parameter identification of proton exchange membrane fuel cells (PEMFCs), Singla et al. [29] proposed an improved AO algorithm incorporating novel mutation strategies, referred to as AOAAO. The experimental results demonstrated its high accuracy, robustness, and time efficiency in real-time fuel cell modeling. Zhang et al. [30] combined opposition-based learning with a local escape operator to propose an improved Aquila Optimizer (LEOAO), achieving better flight paths for UAV path planning in three-dimensional environments. To meet the requirements of multi-task trajectory planning for multiple UAVs in three-dimensional agriculture, Liu et al. [31] proposed an interference-enhanced portfolio Aquila Optimizer (IEP-AO). This method improves trajectory search capability in complex operational spaces and large-scale task scenarios, enabling UAVs to escape local optimal trajectories more effectively.
However, existing studies on improving Aquila Optimizer still lack a systematic investigation into the adaptive regulation of its behavior selection mechanism. To address this gap, this paper focuses on multi-UAV cooperative path planning and conducts an in-depth analysis and improvement of AO. An enhanced behavior-adaptive Aquila Optimizer (EAO) is proposed and applied to multi-UAV cooperative path planning in complex environments.
The main research work of this paper can be summarized as follows:
- A multi-strategy cooperative behavior-adaptive Aquila Optimizer is proposed. By incorporating a periodic diversity maintenance mechanism, a diversity-based dynamic neighborhood guidance mechanism, a narrowed exploitation behavior based on neighborhood differential evolution, and a search-state-aware adaptive behavior selection mechanism, agent behaviors can be adaptively selected according to the current search state. This design effectively enhances the global search capability and convergence stability of the algorithm in complex search spaces.
- Through parameter sensitivity experiments, reasonable parameter configurations for EAO are determined, and ablation experiments systematically verify the effectiveness of each improvement strategy. Further exploration–exploitation capability analysis experiments evaluate the behavioral evolution characteristics of the algorithm during the search process. Additionally, based on the CEC2017 and CEC2020 standard benchmark suites, the global optimization performance of EAO on low-, medium-, and high-dimensional problems is systematically tested. The experimental results demonstrate that EAO exhibits significant overall advantages compared to 14 other representative competitive algorithms.
- A multi-UAV cooperative path-planning model is constructed, comprehensively considering threat zones, no-fly zones, and UAV dynamic constraints to improve the feasibility of the algorithm in practical application scenarios. Based on this, four simulation map environments ranging from simple to complex are designed, and multi-UAV cooperative path-planning simulation experiments are conducted to systematically verify the proposed algorithm’s performance in path quality, result stability, and cooperative efficiency.
The structure of this paper is arranged as follows: Section 1 introduces the research background, related research status, and the main research content of this paper. Section 2 presents the mathematical modeling of the multi-UAV path-planning problem. Section 3 explains the principles of the original AO algorithm. Section 4 elaborates on the design philosophy and implementation process of the improved Aquila Optimizer. Section 5 presents an experimental analysis of parameter sensitivity for the EAO algorithm. Section 6 validates the optimization performance of the algorithm based on the CEC2017 and CEC2020 benchmark suites. Section 7 conducts simulation experiments for multi-UAV cooperative path planning and analyzes the results. Section 8 summarizes the entire work.
2. Multi-UAV Cooperative Path-Planning Modeling
This section discusses the modeling of multi-UAV cooperative path planning [32,33,34], including decision variables, objective functions, and cost function components.
2.1. Decision Variables
The multi-UAV path is composed of UAVs. The trajectory of the th UAV is represented by waypoints. For planning convenience, each waypoint is described using spherical coordinates. Its decision variable is defined as follows:
where denotes the radial distance of the th waypoint of the th UAV; represents the pitch angle of the th waypoint of the th UAV, used for vertical obstacle avoidance; is the azimuth angle of the th waypoint of the th UAV, employed for horizontal path planning; and indicates the number of waypoints for the th UAV.
The principle of the decision variable space is illustrated in Figure 1. By applying the transformation function from spherical to Cartesian coordinates, the actual sequence of UAV positions in three-dimensional space can be obtained:
2.2. Objective Function
The objective of the path-planning problem is to find an optimal solution that minimizes the total objective function value. The objective function adopts a two-tier penalty design. The weighted cost function provides smooth guidance within the feasible region, while the additional penalty term strictly enforces critical safety constraints and guarantees feasibility. Their formulas are defined as follows:
The weighted cost function is expressed as a linear combination of multiple sub-cost functions:
where to represent the path length cost, altitude cost, threat cost, no-fly zone constraint cost, time coordination cost, collision avoidance cost, angle constraint cost, and trajectory segment constraint cost, respectively. The corresponding weight coefficients are denoted as to .
To strictly enforce constraints related to threat regions, no-fly zones, and terrain safety, an additional penalty term is imposed beyond the weighted cost function. The penalty term is formulated as follows:
where denotes the number of penetrations into the core area of the radar threat zone; denotes the number of penetrations into the core area of the artillery threat zone; denotes the number of terrain collisions; denotes the number of violations in the no-fly zone.
2.3. Cost Function Components
2.3.1. Flight Path Cost
The flight path cost function aims to generate the shortest flight path to minimize mission duration and energy consumption. It is defined as
where the total path length of the th UAV; the maximum allowable path length for the th UAV; and denotes the Cartesian coordinates of the th waypoint of the th UAV.
2.3.2. Altitude Cost
The altitude cost ensures that UAVs operate within a safe altitude range to avoid terrain collisions and comply with airspace regulations. It is defined as
where denotes the terrain collision penalty, represents the altitude boundary violation penalty, and indicates the ground clearance (altitude above terrain). and denote the lower and upper bounds of the allowable flight altitude, respectively.
2.3.3. Threat Cost
The threat cost forces UAVs to avoid radar and artillery threat zones, which are modeled as spheres in three-dimensional space. The threat cost includes an inverse-distance penalty and a fixed-level penalty. The formula is as follows:
where represents the radar threat cost; represents the artillery threat cost; denotes the shortest three-dimensional Euclidean distance from trajectory segment of the UAV to threat center ; represents the effective radius of radar threat zone ; denotes the effective radius of artillery threat zone ; indicates the number of radar threat zones; and denotes the number of artillery threat zones.
2.3.4. No-Fly Zone Constraint Cost
The no-fly zone constraint cost compels UAVs to avoid designated prohibited areas, which are modeled as vertical cylinders extending from the ground to a specified height. The formulation of is given as follows:
where denotes the constraint cost of the th trajectory segment of the th drone in the jth no-fly zone; represents the number of no-fly zones; is the shortest distance from the projection of the th trajectory segment of the th drone onto the xy-plane to the center of the th no-fly zone; is the cylindrical radius of the th no-fly zone; denotes the cylindrical height of the no-fly zone; and represents the terrain height of the th no-fly zone.
2.3.5. Time Synchronization Cost
The time synchronization cost ensures multiple UAVs arrive at their respective target points within a reasonable time window, achieving temporal coordination. is defined as
where represents the time deviation of the th drone; denotes the synchronization target time for the th drone; indicates the path length of the th drone; is the actual flight time of the th drone; and respectively denote the maximum and minimum speeds of the th drone.
2.3.6. Collision Avoidance Cost
The collision avoidance cost ensures multiple UAVs maintain safe distances simultaneously to prevent mid-air collisions. is defined as
where represents the collision avoidance cost between UAV and UAV at time ; denotes the Euclidean distance between UAV and UAV at time .
2.3.7. Angular Constraint Cost
The angular constraint cost limits the turn angle between consecutive waypoints in a UAV’s flight path, ensuring the planned route aligns with the UAV’s physical maneuverability. is defined as
where represents the turn angle of the th UAV at the th path point; denotes the maximum allowable turn angle.
2.3.8. Trajectory Segment Constraint Cost
To prevent overly dense and impractical paths, each trajectory segment must possess sufficient length to meet the minimum step size requirement for flight control. The trajectory segment constraint cost is defined as
where is the minimum allowable track segment length.
3. Aquila Optimizer (AO)
AO mimics four different hunting strategies of the aquila to establish a mathematical model. Based on the varying characteristics of solutions within the search space, it flexibly applies different search strategies. Table 1 lists the symbols and naming conventions used by AO.
3.1. Expanded Exploration
In this phase, the eagle flies at high altitude and extensively explores the search space. Once the eagle identifies the area where prey is located, it executes a vertical dive. The mathematical model for this behavior is expressed as
where is the solution of the next iteration for ).
3.2. Narrowed Exploration
After identifying the prey’s area from high altitude, the eagle hovers above the target prey, prepares to land, and then launches an attack. This method is called contour flight followed by a short glide attack. The mathematical expression for this behavior is
where is the solution for the next iteration of ; is a random solution taken within the range at the th iteration; is a constant fixed at 0.01; and is a constant fixed at 1.5.
The spiral shape in the search is represented by and , calculated as follows:
where takes a value between 1 and 20 to fix the number of search cycles; is fixed at 0.00565; is an integer from 1 to the search space length ; and is a small fixed value of 0.005.
3.3. Expanded Exploitation
After precisely identifying the prey’s area, the eagle performs an initial vertical descent to test the prey’s reaction. This method is called the low-flying slow descent attack. The mathematical expression for this behavior is
where is the solution for the next iteration of ; and are adjustment parameters, typically set to 0.1.
3.4. Narrowed Exploitation
When the eagle approaches its prey, it chases and attacks the prey on the ground. The mathematical expression for this behavior is
where is the solution for the next iteration of .
4. Behavior-Adaptive Aquila Optimizer (EAO)
Although Aquila Optimizer provides an effective framework for solving complex optimization problems by simulating the hunting behavior of the eagle, it still has inherent limitations when addressing high-dimensional, multimodal, and other complex scenarios. Firstly, its behavior switching relies on fixed iteration progress rather than real-time search states, which prevents the search process from adapting to the dynamic characteristics of the problem. This leads to difficulties in sustaining effective strategies and results in low search efficiency. Secondly, the algorithm lacks effective diversity maintenance and distributed guidance mechanisms. The population tends to rapidly homogenize under the attraction of a single global optimal solution, making it prone to falling into local optima when tackling complex optimization problems. Additionally, the randomness in its local search strategies results in insufficient exploitation precision and stability.
To address these systematic limitations, this section proposes an enhanced version of Aquila Optimizer—the Adaptive Aquila Optimizer (EAO). Its core improvements lie in introducing a series of adaptive mechanisms based on search-state awareness and deep collaboration among strategies: Firstly, a behavior selection and trust mechanism based on search-state awareness is designed to dynamically determine the timing for behavior switching, ensuring the continuity of effective strategies. Secondly, periodic diversity maintenance and dynamic neighborhood guidance strategies are introduced to avoid premature convergence while fully utilizing local search information. Lastly, a walking predation behavior based on neighborhood differential evolution is adopted to enhance the stability and efficiency of local search. These strategies collectively form an adaptive and collaborative optimization framework to improve the algorithm’s ability to balance global exploration and local exploitation. The schematic diagram illustrating the behavior adaptation principle of EAO, composed of multiple synergistic strategies, is shown in Figure 2. Table 2 lists the symbols and naming conventions used by EAO.
4.1. Periodic Diversity Maintenance Mechanism
The original AO lacks an active diversity maintenance strategy, which can lead to rapid loss of population diversity and a tendency to fall into local optima. Relying solely on random perturbations makes it difficult to escape local optima. Therefore, a periodically triggered diversity maintenance mechanism is introduced.
The population diversity index is defined as shown in Equation (40). It reflects the dispersion degree of the population [35]. A high value indicates an exploration phase, while a low value indicates an exploitation phase.
where denotes the Euclidean norm.
The mechanism is configured to check every iterations. Based on the results of the parameter sensitivity test (see Appendix A.2, Table A3), set . If it falls below a threshold, selective resetting is applied to the worst 15% of individuals in terms of fitness. Specifically, individuals with fitness values greater than 1.1 times the current best fitness are reset. The reset individual position is given by
where is a uniformly distributed random vector of dimension .
4.2. Diversity-Based Dynamic Neighborhood Guidance Mechanism
In the original AO, all individuals are guided solely by the single global best solution , which can lead to rapid population clustering and loss of diversity. This increases the risk of premature convergence and fails to fully utilize local information from different spatial regions. To address this issue, EAO replaces the global best with a neighborhood-best approach in both the expanded exploration and expanded exploitation phases of the original AO. Additionally, the neighborhood size is dynamically adjusted based on population diversity, preventing excessive population concentration during these phases.
Let the neighborhood of individual consist of its nearest neighbors among the other individuals. The neighborhood best position for individual is then defined as
The neighborhood size dynamically adjusts based on :
Therefore, replacing the global optimal position with the neighborhood optimal position in Equations (26) and (35) from Phase 1 and Phase 3 of the AO algorithm yields
4.3. Narrowed Exploitation Behavior Based on Neighborhood Differential Evolution
The narrowed exploitation formula in the original AO is complex and relies on Lévy flights. The long-jump property of Lévy flights can lead to instability during the local refinement search phase, resulting in insufficiently clear search directions. To address this, EAO replaces Equation (36) with a neighborhood-based differential mutation operator [36]:
where and are two distinct individual positions randomly selected from the neighborhood.
4.4. Adaptive Behavior Selection Mechanism Based on Search-State Awareness
The original AO adopts a behavior-switching strategy based on fixed iteration progress: exploration behaviors are randomly selected when , and exploitation behaviors are randomly selected when . However, this mechanism has inherent limitations. First, the search state can vary significantly across different problems at the same iteration progress. Fixed switching thresholds fail to adapt to the dynamic characteristics of the problem. Second, the strategy cannot adjust behavioral choices based on search effectiveness, potentially causing effective behaviors to be prematurely abandoned or ineffective behaviors to be over-executed, lacking adaptivity.
To address these issues, EAO introduces an adaptive behavior selection mechanism based on search-state awareness. This mechanism dynamically selects exploration or exploitation behaviors according to the current search state, thereby improving the algorithm’s ability to balance exploitation and exploration.
The adaptive behavior selection mechanism achieves self-adaptive decision-making by
Introducing three core search-state metrics to monitor the optimization process in real-time;Establishing behavior-selection rules and a trust mechanism based on these states;Incorporating a forced exploration behavior to ensure adaptability.
4.4.1. Definition of Search-State Quantities
(1) Optimal Solution Improvement Rate
The optimal solution improvement rate is defined to reflect search effectiveness. A high value indicates the current search direction is effective, while a low value suggests potential stagnation.
(2) Failure Experience Metric
Define the failure experience metric to reflect search stability. A high value indicates that most individuals failed to achieve improvement, suggesting the search may be trapped in local optima.
where denotes the indicator function, which returns 1 when the condition is true and 0 otherwise.
4.4.2. State-Driven Behavior Selection Rules
Based on the state variable combinations defined in Section 2.1, these are mapped to six behavioral modes. The six behaviors are: expanded exploration, narrowed exploration, expanded exploitation, and narrowed exploitation from AO; and the newly defined forced exploration and random exploitation from EAO.
In EAO, forced exploration refers to the scenario where, if the eagle fails to capture prey in the current hunting area over an extended period, the situation indicates the area is depleted. The eagle then leaves the current hunting ground and re-explores the nearby region. The formula for generating a new position in this behavior is
where is its maximum value, set to .
Thus, the forced exploration behavior performs a substantial random jump centered around the current best solution within a limited range. This approach leverages known information while retaining the ability to escape local traps.
Random exploitation refers to randomly selecting either expanded exploitation or narrowed exploitation.
The six behaviors are numbered from 1 to 6, and their selection rules are defined as
where , , represent the critical thresholds for behavior selection based on , and , respectively, with values set as , , and .
4.4.3. Trust-Based Behavioral Persistence Mechanism
The original AO algorithm exhibits strong randomness and a lack of continuity in behavior selection. Even if the current search behavior is effective, it may be randomly switched in the next generation, leading to inconsistent search directions and low convergence efficiency. To address this, this mechanism introduces the concept of “trust” to enable the algorithm to adaptively extend the execution time of effective behaviors based on historical performance and promptly discard ineffective ones.
The trust-based behavior decision logic is defined as
where is the behavior trust, representing the remaining number of iterations for which this behavior can continue to be executed.
When the trust , the individual will continue executing the previous generation’s behavior. When the trust is depleted ( ), it switches to a new suggested behavior.
After each generation of search, rewards or penalties are applied based on whether the individual improves. If the fitness value of the new position is better than the original position, a reward is executed. The trust update formulas are as follows:
where is the reward function.
If the fitness value of the new position is not better than the original position, a penalty is executed. The trust update formulas in this case are
4.5. Pseudocode and Program Flowchart
The EAO algorithm integrates a periodic diversity maintenance mechanism, a diversity-based dynamic neighborhood guidance mechanism, a narrowed exploitation behavior based on neighborhood differential evolution, and an adaptive behavior selection mechanism based on search-state awareness. The flowchart of the EAO algorithm which enables adaptive behavior selection is shown in Figure 3. The time complexity analysis is provided in Appendix A.1, and the pseudocode of EAO (Algorithm 1) is as follows:
Algorithm 1 Pseudocode for EAO
1: Initialize population randomly and evaluate fitness. 5: t = t + 1
using Equations (40)–(42). 10: Reinitialize worst 15% individuals using Equation (43).11: Evaluate new individuals and update 12: end if13: end if using Equations (50) and (51). using Equations (45) and (46). using Equations (38) and (39).17: for i = 1 to N do to all individuals, sort and select k(t) neighbors. using Equation (44). using Equation (54). using Equation (55). then using Equation (47). then using Equation (28). then using Equation (48). then neighbors. using Equation (49) then using Equations (52) and (53).33: else and apply corresponding update.35: end if using Equations (59) and (60). if better solution found. 39: end for40: end while
5. Parameter Sensitivity Analysis of EAO
Parameter selection is crucial for the performance of metaheuristic algorithms. Therefore, this section conducts a parameter sensitivity analysis on six key parameters in EAO using 12 functions from the CEC2017 test suite: F1, F3, F6, F9, F11, F12, F13, F15, F24, F27, F29 and F30. The analyzed parameters include population size , historical window length , minimum neighborhood size , maximum neighborhood size percentage , low diversity threshold , and high diversity threshold .
During the experiments, each parameter is varied within its predefined range while the others remain fixed, to explore the impact of different values on EAO’s performance. The maximum number of function evaluations is set to 1000 × Dim. To avoid randomness, each experiment is repeated 30 times. The best value, average value, variance, and Friedman ranking are calculated. The overall Friedman ranking comparison is shown in Figure 4, and detailed data is provided in Table A4, Table A5, Table A6, Table A7, Table A8 and Table A9 of Appendix A.2.
According to the experimental results, the best overall Friedman ranking is achieved when , , . In the sensitivity analysis of population size, EAO performs better with smaller populations. This is because EAO’s adaptive behavior persistence mechanism and diversity maintenance mechanism function more effectively when the number of iterations is sufficient. However, considering the statistical significance of algorithm comparisons, the standard setting is adopted for the experiments in this paper.
6. Performance Testing and Analysis of EAO
To systematically and comprehensively evaluate the optimization performance and algorithmic characteristics of the proposed EAO algorithm, this chapter conducts multi-angle and multi-level experimental analyses based on the internationally recognized CEC2017 and CEC2020 benchmark test suites. The CEC2017 function set includes various unimodal, multimodal, hybrid, and composite functions that are suitable for assessing an algorithm’s optimization capability in medium- to high-dimensional complex search spaces. The CEC2020 function set presents greater challenges in terms of function construction and variable coupling, often used to verify an algorithm’s generalization ability under different problem characteristics and lower-dimensional conditions. Detailed descriptions of the functions in both test suites are provided in Table A1 and Table A2 of Appendix A.2.
Regarding the design of experimental content: first, multiple EAO variants are constructed, and ablation experiments are conducted on the 30-dimensional test functions of CEC2017 to analyze the effectiveness of each improvement strategy; second, by introducing a population dimensional diversity metric, the exploration and exploitation capabilities of the algorithm are assessed; finally, under different dimension settings, systematic performance comparisons between EAO and multiple competing algorithms are carried out on the CEC2017 (30/50/100 dimensions) and CEC2020 (10/20 dimensions) benchmark test suites, accompanied by statistical significance analysis using the Friedman ranking test and the Wilcoxon rank-sum test.
To thoroughly validate the optimization performance and competitive advantages of EAO, a total of 14 representative optimization algorithms are selected as comparison methods to ensure the comprehensiveness of the results. These include: (1) classic and highly cited algorithms: PSO, DE, CMA-ES (Covariance Matrix Adaptation Evolution Strategy, 2016) [37], WOA, SCA (Sine Cosine Algorithm, 2016) [38], MFO, and HHO (Harris Hawks Optimization, 2019) [39]; (2) recent high-performance algorithms: MShOA (Mantis Shrimp Optimization Algorithm, 2025) [40], GJO, (Weighted Average Algorithm, 2025) [41], SHO (Sea-Horse Optimizer, 2022) [42], HO (Hippopotamus Optimization, 2024) [43], and PO (Parrot Optimizer, 2024) [44]; (3) the original Aquila Optimizer (AO).
6.1. Experimental Configuration
The parameter settings of the algorithms refer to their respective original papers, with specific configurations listed in Table 3. All compared algorithms are run under the same experimental conditions: the population size is set to 30, and the maximum number of function evaluations is set to 1000 × Dim. To ensure the fairness and reproducibility of the experimental results, all algorithms are executed 30 independent runs on the benchmark test suites. The best value, mean value, variance, Friedman ranking, and Wilcoxon rank-sum test results are calculated. All experiments are conducted on the software MATLAB R2021b.
6.2. Ablation Experiment
To verify the contribution of each improvement strategy in EAO to the algorithm’s performance enhancement, this section conducts ablation experiments on the 30-dimensional test functions of CEC2017. By introducing different improvement mechanisms into the original AO algorithm, various algorithm variants are constructed to analyze the independent role of each strategy in improving search performance. These algorithm variants include: (1) EAO_D: Introduces a periodic diversity maintenance mechanism into the original AO; (2) EAO_N: AO combined with a dynamic neighborhood guidance mechanism based on population diversity; (3) EAO_E: Integrates a neighborhood differential evolution-based search strategy into the narrowed exploitation phase of AO; (4) EAO_S: Introduces a search-state-aware adaptive behavior selection mechanism into AO. By comparing the best values, mean values, and standard deviations of each variant with the complete EAO across test functions, and combining the results of Friedman rankings, the contribution of different improvement strategies to the overall performance of the algorithm can be intuitively assessed.
Table 4 presents the test results of various algorithm variants, including the best value, mean value, variance, and Friedman ranking. The best values in the table are highlighted in bold and underlined.
Overall, compared to other variants and the original AO algorithm, EAO achieves the highest number of best values across the test functions and ranks first overall. This indicates that the improvement strategies, when combined, can effectively enhance the algorithm’s search capability and stability. In contrast, algorithm variants enhanced with a single strategy show some improvement on certain functions, but their overall performance remains weaker than that of EAO. Through comparison of different variants, it can be observed that both EAO_S and EAO_D outperform the original AO in ranking, suggesting that the search-state-aware adaptive behavior selection mechanism and the periodic diversity maintenance mechanism positively impact algorithm performance. Although EAO_N and EAO_E rank lower than AO, they achieve optimal results in some metrics, indicating that the dynamic neighborhood guidance mechanism and the neighborhood differential evolution-based search strategy must be organically integrated with other strategies to achieve optimal optimization performance.
6.3. Exploration and Exploitation Experiment
The balance between exploration and exploitation is a key factor determining the performance of swarm intelligence algorithms. To deeply analyze the search behavior characteristics of EAO, as described in this subsection, F1, F4, F6, F10, F11, F20, F22, F25 and F28 are selected from the four types of functions in CEC2017 to quantitatively evaluate the algorithm’s exploration and exploitation capabilities. The population dimensional diversity metric [45,46] is introduced in the experiments to characterize the changes in population distribution during the search process. By analyzing the evolutionary trend of diversity over iterations in EAO, the behavioral characteristics of its global exploration and local exploitation phases can be visually reflected. The formulas for the population dimension diversity index Div and the exploration–exploitation ratio are as follows:
where is the maximum population dimension diversity value; is the position of the th individual in the th dimension; and is the median of the positions of all individuals in the th dimension.
Figure 5 shows the exploration and exploitation variation curves of EAO on the CEC2017 30-dimensional test functions, illustrating the dynamic balance between exploration and exploitation capabilities during the search process. From the figure, it can be observed that EAO maintains a relatively high exploration ratio in the early stages of the search, which helps fully explore the search space and avoid premature convergence to local optima. As the iteration process progresses, the exploration ratio gradually decreases while the exploitation ratio increases, indicating that the algorithm can appropriately enhance exploitation capability to accelerate the convergence process.
6.4. Convergence Analysis
To further analyze the convergence characteristics of the proposed EAO algorithm, eight functions, including F1, F4, F6, F9, F13, F19, F20, and F25 from the CEC2017 benchmark suite were selected. The convergence behavior of EAO was studied in a 30-dimensional search space. The number of iterations was set to 50 in the experiments. Several results were presented, including the search history distribution, the trajectory changes of five search agents in the first dimension, the average fitness curve of the population, and the convergence curve of the optimal value.
Figure 6 shows the experimental results in a two-dimensional space. In the search history plots (the second column), the red star represents the global optimum. It can be observed from the distribution of sample points that the points far from the global optimum are sparse and widely scattered. In contrast, the points near the global optimum are densely distributed. This indicates that EAO has strong global exploration capability and can effectively conduct local exploitation in regions containing potential optima.
The trajectories of five search agents in the first dimension are shown in the third column. As the iterations proceed, the variations in the search agents’ values gradually become stable. The average fitness curve of the population (the fourth column) and the convergence curve of the optimal value (the fifth column) both decline steadily throughout the iteration process. This further demonstrates that the EAO algorithm can converge stably to the optimal solution.
6.5. Test Results and Analysis on CEC2017
Figure 7 summarizes the average ranking values of EAO and other algorithms across 30/50/100 dimensions. The convergence curves and boxplots obtained by all algorithms on each function are shown in Figure 8, Figure 9, Figure 10, Figure 11, Figure 12 and Figure 13, respectively. To enhance readability, detailed test results are presented in Table A10, Table A11, Table A12, Table A13, Table A14 and Table A15 of Appendix A.2, including the best value, mean value, variance, p-value, and Friedman ranking. The best values in the tables are highlighted in bold and underlined. The symbols “+/=/–” in the tables indicate that EAO performs better than, equal to, or worse than the other algorithms, respectively.
As can be seen from the data in the tables, EAO achieves the highest number of best metric values compared to the other algorithms across all dimensions. For the 30-dimensional case, EAO obtains the best value in 28 out of a total of 87 metrics across all functions, achieves the first rank in 19 functions based on the Friedman test, and has an average rank of 1.6. When the dimension is 50 or 100, EAO achieves the best value in 36 metrics and ranks first in 17 functions, with average ranks of 1.6 and 1.5, respectively. Therefore, EAO ranks first among all algorithms across all dimensions, demonstrating its ability to maintain good search accuracy and stability across different dimensions, especially in medium- and high-dimensional problems. Meanwhile, the results of the Wilcoxon rank-sum test show that EAO has a statistically significant performance advantage over most of the compared algorithms. These results fully validate the scalability and competitive advantage of EAO in medium- and high-dimensional complex optimization problems. Furthermore, the convergence curves show that EAO is able to escape local traps, even in the middle and later stages, on most functions and quickly find better solutions, indicating that the search-state-aware adaptive behavior selection framework, which integrates the four improvement strategies, effectively enhances convergence accuracy.
6.6. Test Results and Analysis on CEC2020
In addition to medium- and high-dimensional optimization problems, the generalization ability of algorithms under different benchmark definitions and lower-dimensional conditions is also worthy of attention. Therefore, this subsection further evaluates the performance of EAO on 10-dimensional and 20-dimensional problems using the CEC2020 benchmark function set. By comparing the performance of EAO with other competing algorithms in terms of best value, mean value, and standard deviation, and then combining the Friedman ranking and Wilcoxon rank-sum test results, the robustness and generalization capability of EAO under different problem characteristics can be verified. Figure 14 summarizes the average ranking values of EAO and other algorithms across 10/20 dimensions. The convergence curves and boxplots obtained by all algorithms on each function are shown in Figure 15, Figure 16, Figure 17 and Figure 18, respectively. To enhance readability, detailed test results are presented in Table A16, Table A17, Table A18 and Table A19 of Appendix A.2, including the best value, mean value, variance, p-value, and Friedman ranking. The best values in the tables are highlighted in bold and underlined. The symbols “+/=/–” in the tables indicate that EAO performs better than, equal to, or worse than the other algorithms, respectively.
From the data in the tables, it can be observed that EAO achieves the highest number of best metric values compared to the other algorithms across all dimensions. For the 10-dimensional case, EAO obtains the best value in 17 out of a total of 30 metrics across all functions, achieves the first rank in 8 functions based on the Friedman test, and has an average rank of 1.5. When the dimension is 20, EAO achieves the best value in 13 metrics and ranks first in 6 functions, with an average rank of 1.6. Therefore, EAO ranks first among all algorithms across all dimensions, indicating that it also maintains good search accuracy and stability in lower-dimensional problems. Meanwhile, the results of the Wilcoxon rank-sum test show that EAO has a statistically significant performance advantage over most of the compared algorithms. Combined with the test results on CEC2017, EAO demonstrates excellent optimization performance across low-, medium-, and high-dimensional problems. This indicates that the proposed improvement strategies are not specifically designed for certain benchmark functions or dimensions but rather exhibit good adaptability and generalization ability under different problem characteristics.
7. Multi-UAV Cooperative Path-Planning Simulation Experiment
To validate the applicability and effectiveness of the proposed Enhanced Aquila Optimizer (EAO) in practical engineering problems, this section applies EAO to a multi-UAV cooperative path-planning simulation scenario and conducts a comparative analysis with various representative optimization algorithms. Compared to standard benchmark functions, the multi-UAV path-planning problem not only exhibits high-dimensional and nonlinear characteristics but also requires simultaneous consideration of UAV dynamic constraints, environmental threats, no-fly zone avoidance, and cooperative safety among multiple UAVs, placing higher demands on the comprehensive performance of optimization algorithms.
This section constructs several three-dimensional flight environments with varying levels of complexity, based on elevation maps. Through quantitative metrics and visual path results, a systematic evaluation of EAO’s optimization capability and cooperative performance in multi-UAV cooperative path-planning tasks is conducted.
7.1. Experimental Configuration for Path-Planning Simulation
The experiment constructs four typical flight scenarios of increasing complexity based on elevation maps with dimensions of 1045 × 879 × 300, as shown in Figure 19. In these scenarios, three UAVs perform cooperative path planning. The yellow square on the map indicates the starting point, the yellow star marks the endpoint, the red cylinder denotes the no-fly zone, and the yellow and white spheres represent threats such as artillery and radar, respectively. Detailed configurations of environmental parameters for different scenarios are provided in Table 5. The constraint parameter settings for the UAVs are listed in Table 6.
Regarding algorithm parameter settings, the maximum number of function evaluations is set to 10,000, and the population size is set to 30. To comprehensively validate the performance of EAO, this paper selects five representative optimization algorithms as comparison methods, including the classic algorithms PSO and SCA, the original Aquila Optimizer (AO), and the newly proposed algorithms GJO and WAA, which demonstrated relatively excellent performance in the earlier benchmark tests. All algorithms are run independently 30 times under identical parameter configurations to minimize the influence of random factors on the experimental results.
In terms of performance evaluation, in addition to comparing the overall performance of the algorithms using the best fitness value, mean value, standard deviation, and Friedman ranking of the optimization problem, the number of collision-free trajectories generated by each algorithm during the path-planning process that avoid no-fly zones and threat areas is also recorded. The average running time of each algorithm is further provided to comprehensively assess the performance of EAO in terms of path-planning quality and computational efficiency.
7.2. Sensitivity Analysis of Objective Function Weight Coefficients
Since the selection of objective function weights can influence the optimization results in multi-UAV path planning, sensitivity and robustness analyses are necessary. Considering that safety is of primary importance, the threat-related cost and the no-fly zone constraint cost are assigned the largest weights. In addition, the time coordination cost among multiple UAVs and the collision avoidance cost are also emphasized.
Accordingly, four key components ( – ) are treated as dominant weights and initially set to 0.18, 0.18, 0.18, and 0.14, respectively. The sensitivity experiments are conducted on the most complex multi-UAV path-planning simulation scenario (Map 4). In each experiment, only one dominant weight is varied by ±20%, while the remaining three dominant weights are kept unchanged. The weights of the other four components (path length, altitude, turning angle, and trajectory segments) are adjusted accordingly to ensure that the sum of all weight coefficients remains equal to one. This design guarantees the fairness and independence of the sensitivity evaluation.
The experimental results are reported in Table 7. When ±20% perturbations are applied to the threat cost, no-fly zone constraint cost, time coordination cost, and collision avoidance cost, the average fitness values generated by EAO exhibited variation ranges of 8.7%, 14.5%, 3.65%, and 14.11%, respectively. Although the absolute fitness values change, the convergence behavior and relative performance trends remain stable. These observations indicate that the proposed EAO algorithm does not rely on a specific weight configuration and exhibits satisfactory robustness.
Based on the above analysis, the final weight coefficients to are set to 0.10, 0.12, 0.18, 0.18, 0.18, 0.14, 0.08, and 0.02, respectively.
7.3. Path-Planning Simulation Results and Analysis
Table 8 presents the results of cooperative path planning for multiple UAVs across four maps using all algorithms. The best values for each metric in the table are highlighted in bold and underlined. Figure 20 shows the average convergence curves of all algorithms for path planning on the four maps, while Figure 21, Figure 22, Figure 23, Figure 24, Figure 25, Figure 26, Figure 27, Figure 28, Figure 29, Figure 30, Figure 31 and Figure 32 display the 3D main view, top view, and side view of the optimal collision-free and threat/no-fly-zone-free paths planned by each algorithm.
From the data in the table, it can be observed that in the four simulation scenarios with progressively increasing complexity, EAO achieves the best results in all fitness-related evaluation metrics, with its Friedman ranking consistently placing first. Compared to the original AO algorithm, EAO improves the average fitness values by 80.42%, 81.25%, 81.34%, and 84.84% in the four map environments, respectively, indicating that the proposed improvement strategies significantly enhance path-planning performance. In terms of the number of valid trajectories, EAO plans the highest number of valid collision-free trajectories that avoid no-fly zones and threat areas in Map 1, Map 2, and Map 4. In Map 3, although the number of valid trajectories generated by EAO is slightly lower than that of WAA, its corresponding fitness value remains superior to WAA, suggesting that EAO demonstrates better overall performance in path quality. Regarding average runtime, the differences among the six compared algorithms are minimal, indicating that EAO improves path-planning performance without significantly increasing computational overhead.
The optimal path-planning results shown in Figure 20, Figure 21, Figure 22, Figure 23, Figure 24, Figure 25, Figure 26, Figure 27, Figure 28, Figure 29, Figure 30, Figure 31 and Figure 32 further demonstrate that EAO can plan smooth flight paths from the start to the endpoint under complex constraints, avoiding collisions and traversing neither threat areas nor no-fly zones. Compared to the second-ranked GJO algorithm, EAO exhibits superior performance in both convergence speed and path quality.
In summary, EAO outperforms the compared algorithms in terms of path quality, result stability, and statistical ranking, demonstrating the best overall performance and validating its effectiveness in multi-UAV cooperative path planning under complex constraints.
8. Conclusions
This paper focuses on the application of Aquila Optimizer AO in complex optimization problems and multi-UAV cooperative path planning. To address the issues of insufficient diversity and limited balance between exploration and exploitation during the search process in the original AO, an enhanced AO based on an adaptive behavior selection mechanism (EAO) was designed and proposed. Its performance was validated through a series of benchmark experiments and path-planning simulation experiments.
First, to determine appropriate values for key parameters in the algorithm, parameter sensitivity experiments were conducted. Based on CEC2017 test functions, the optimization performance of EAO under different parameter combinations was compared and analyzed. By comparing statistical results such as the best, mean, and variance of function values, a parameter configuration with relatively better performance was selected, providing a unified parameter basis for subsequent experiments.
Second, ablation experiments and exploration–exploitation experiments were performed on the 30-dimensional test functions of CEC2017. In the ablation experiments, multiple algorithm variants were constructed and compared with the original AO to systematically analyze the effectiveness of each improvement strategy across different test functions. The experimental results indicate that each improvement strategy positively contributed to enhancing the algorithm’s performance, while the complete EAO demonstrated the best overall performance. In the exploration–exploitation experiments, the population dimensional diversity metric was introduced to analyze the changes in exploration and exploitation capabilities during the iterative process. The results show that EAO exhibits reasonable behavioral variations in different phases of the search process.
In terms of comprehensive performance evaluation, this paper further conducted systematic comparative tests between the proposed algorithm and 14 other representative competing algorithms on the CEC2017 and CEC2020 benchmark suites. Specifically, the CEC2017 test set was used under 30-, 50-, and 100-dimensional conditions to evaluate the algorithm’s performance in medium- to high-dimensional optimization problems, while the CEC2020 test set was employed under 10- and 20-dimensional conditions to examine its optimization performance in lower-dimensional problems. The experimental results were analyzed using statistical metrics such as the best, mean, and variance values, along with the Friedman ranking and Wilcoxon rank-sum test. The results, when compared, demonstrate that EAO achieved stable optimization results across most test functions and different dimensional conditions, securing the top overall ranking.
Finally, the proposed EAO and five other competing algorithms were applied to multi-UAV cooperative path-planning simulation experiments. Multiple experimental environments, ranging from simple to complex, were constructed based on elevation maps, and factors such as no-fly zones, threat areas, and UAV dynamic constraints were comprehensively considered to simulate the multi-UAV cooperative path-planning process. Comparative analysis of metrics such as path-planning fitness, number of valid trajectories, and average runtime shows that EAO ranks first among the six algorithms. Compared to the original AO algorithm, EAO improved the average fitness values by 80.42%, 81.25%, 81.34%, and 84.84% across different map environments, confirming its feasibility and effectiveness in multi-UAV path-planning problems.
From an application perspective, the path-planning method based on the adaptive behavior mechanism proposed in this paper is not only suitable for multi-UAV cooperative path planning but can also be extended to other complex optimization and planning tasks characterized by high dimensionality, strong nonlinearity, and multiple constraints. Examples include mobile robot path planning, multi-robot cooperative search and coverage, unmanned vehicle path planning, and parameter optimization in complex engineering systems. Due to its low dependency on problem modeling and good generality and scalability, EAO holds potential for application in related fields of intelligent optimization and autonomous decision-making.
Although the improved AO proposed in this paper has achieved certain success in multi-UAV path planning, the current research is primarily based on the assumption of a static environment and does not fully account for factors such as dynamic threats and real-time task changes. Therefore, future work will explore the following directions to extend this study: First, the proposed behavior-adaptive AO framework will be extended to dynamic environments, where time-varying threats, moving obstacles, and real-time task changes are considered, enabling online path replanning for multi-UAV systems. Second, more complex cooperative scenarios will be investigated, including communication constraints, formation maintenance, and dynamic task allocation among multiple UAVs. Finally, the proposed method will be validated in larger-scale and more realistic environments to further evaluate its scalability and practical applicability.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Jones M. Djahel S. Welsh K. Path-planning for unmanned aerial vehicles with environment complexity considerations: A survey ACM Comput. Surv.20235513910.1145/3570723 · doi ↗
- 2Meng W. Zhang X. Zhou L. Guo H. Hu X. Advances in UAV Path Planning: A Comprehensive Review of Methods, Challenges, and Future Directions Drones 2025937610.3390/drones 9050376 · doi ↗
- 3Aljalaud F. Kurdi H. Youcef-Toumi K. Bio-inspired multi-UAV path planning heuristics: A review Mathematics 202311235610.3390/math 11102356 · doi ↗
- 4Wayahdi M.R. Ginting S.H.N. Syahputra D. Greedy, A-Star, and Dijkstra’s algorithms in finding shortest path Int. J. Adv. Data Inf. Syst.20212455210.25008/ijadis.v 2i 1.1206 · doi ↗
- 5Hart P.E. Nilsson N.J. Raphael B. A formal basis for the heuristic determination of minimum cost paths IEEE Trans. Syst. Sci. Cybern.1968410010710.1109/TSSC.1968.300136 · doi ↗
- 6Koenig S. Likhachev M. D* lite Proceedings of the Eighteenth National Conference on Artificial Intelligence American Association for Artificial Intelligence Menlo Park, CA, USA 2002476483 Available online: https://dl.acm.org/doi/abs/10.5555/777092.777167(accessed on 27 January 2026)
- 7Ma C.S. Miller R.H. MILP optimal path planning for real-time applications Proceedings of the 2006 American Control Conference, Minneapolis, MN, USA, 14–16 June 2006 IEEE New York, NY, USA 200610.1109/ACC.2006.1657504 · doi ↗
- 8Mac T.T. Copot C. Tran D.T. De Keyser R. Heuristic approaches in robot path planning: A survey Robot. Auton. Syst.201686132810.1016/j.robot.2016.08.001 · doi ↗
