A synergistic enhancement of the Ivy algorithm for GAN-based imbalanced classification
Hanjie Xu, Jian Xiong, Jinyu Wu, Xianlai Zhou, Ronghu Xu, Haozheng Wu

TL;DR
This paper introduces an improved Ivy Algorithm that enhances optimization performance and is applied to improve GANs for classifying imbalanced data.
Contribution
The paper proposes an Enhanced Ivy Algorithm with three synergistic mechanisms for better optimization and GAN hyperparameter tuning.
Findings
E-IVYA outperformed other algorithms on IEEE CEC 2014 and 2017 benchmarks.
E-IVYA-optimized GAN achieved an F1-Score of 0.87 on the Credit-Card Fraud dataset, surpassing SMOTE.
Abstract
The Ivy Algorithm (IVYA), a swarm intelligence algorithm inspired by plant growth, presents a novel framework for optimization. To unlock its full potential in complex, high-dimensional problems, it is crucial to address the fundamental challenge of balancing exploration and exploitation, which can impact overall search efficiency and solution quality. To this end, this paper proposes an Enhanced Ivy Algorithm (E-IVYA) that integrates three synergistic mechanisms. First, a dynamic perturbation framework combining symmetric and asymmetric exploration is introduced to maintain population diversity. Second, a dynamic escape mechanism based on elite differential mutation is employed to prevent search stagnation and effectively escape from local optima. Third, an adaptive movement strategy inspired by the Sine-Cosine Algorithm is integrated to achieve a more adaptive balance between global…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 10
Figure 11
Figure 12
Figure 13
Figure 14
Figure 15
Figure 16
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 17- —Guangdong Provincial Key Discipline Research Capacity Improvement Project on ’Artificial Intelligence Application Research Based on Medical Imaging Big Data’
- —Guangdong Provincial Modern Industry College Project on ’Intelligent Finance Industry College’
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFinancial Distress and Bankruptcy Prediction · Imbalanced Data Classification Techniques · Stock Market Forecasting Methods
Introduction
In many real-world machine learning applications, the Class Imbalance Problem (CIP) is a pervasive challenge that can severely impact the performance of traditional classification models ^1–3^. This issue, where samples of some classes vastly outnumber others, is particularly critical in domains such as medical diagnosis and financial fraud detection, where the misclassification of minority class instances carries significant consequences ^4,5^. While various data-level approaches exist, classic oversampling techniques like SMOTE, although widely used, may generate synthetic samples that are noisy, borderline, or insufficiently diverse, thus failing to capture the true underlying data distribution ^5^. To overcome these limitations, data augmentation via Generative Adversarial Networks (GANs) has emerged as a promising approach, capable of generating high-quality, realistic synthetic samples for the minority class, thereby balancing the dataset and improving classifier performance ^6,7^. However, the success of GANs is critically dependent on an optimal set of hyperparameters, and their training is notoriously sensitive and complex ^8,9^. Manually tuning these parameters is often inefficient and computationally prohibitive, turning the task of GAN configuration into a complex, high-dimensional optimization problem in its own right ^10^. This challenge falls squarely into the domain of optimization, which is a central task in scientific inquiry and engineering design ^11–13^. Due to the complexity and high computational demands, metaheuristic algorithms have emerged as a class of effective approximate methods for solving such problems ^14,15^.
Metaheuristic algorithms simulate natural phenomena, physical laws, or social behaviors to intelligently search the solution space ^16^. Based on their source of inspiration, these methods are broadly classified into several major branches, including Evolutionary Algorithms (EAs) like the Genetic Algorithm (GA) ^17^, and Swarm Intelligence (SI) algorithms like Particle Swarm Optimization (PSO) ^18^. The field continues to evolve rapidly, with numerous high-performing algorithms proposed recently, such as the Starling Murmurination Optimizer (SMO) ^19^ and the White Shark Optimizer (WSO) ^20^. However, it is important to acknowledge that this rapid proliferation has sparked a critical discussion within the scientific community regarding the novelty of “metaphor-based” algorithms ^21,22^. We concur that an algorithm’s value is determined not by its narrative, but by the principled design of its search operators and its empirical performance. Our focus, therefore, is on principled algorithm engineering.
Despite the wide variety of metaheuristic algorithms, nearly all face a core challenge: achieving an effective balance between global exploration and local exploitation ^23^. Exploration allows the algorithm to search the entire solution space broadly, while exploitation involves a fine-grained search within promising regions to refine solution precision ^24,25^. For an intricate task such as GAN hyperparameter optimization, an overemphasis on exploitation can trap the search in a suboptimal set of parameters, whereas excessive exploration may lead to inefficient convergence ^26,27^. This underscores the necessity of developing improved algorithms specifically engineered to navigate this trade-off more intelligently.
In line with the algorithm engineering focus established above, this paper treats the recently introduced Ivy Algorithm (IVYA) ^28^ not as a novel metaphor, but as a representative baseline structure that exhibits these exact challenges. To this end, we propose an Enhanced Ivy Algorithm (E-IVYA). We introduce a synergistic set of non-metaphorical enhancement strategies, designed to directly address the fundamental, general challenges of metaheuristic optimization:
- To counteract the loss of population diversity, we introduce a Dynamic Perturbation Framework.
- To overcome search stagnation in local optima, we design a Dynamic Perturbation Escape Mechanism.
- To resolve the static balance between search behaviors, we incorporate an Adaptive Movement and Balancing Mechanism. By synergistically combining these improvements, this work aims to develop a robust and efficient optimization tool, validated through rigorous benchmarking and a complex, real-world application in automated machine learning.
Motivation and proposed enhancements for the Ivy algorithm
Within the vast landscape of metaheuristic algorithms, the selection of a suitable foundational algorithm for in-depth research is of paramount importance. In line with the principled algorithm engineering focus established in our Introduction, this study selects the Ivy Algorithm (IVYA) ^28^ as the basis for enhancement.
Our decision is predicated not on its bio-inspired narrative, but rather on its clear and modular structure, which provides an ideal ’experimental platform’. This platform allows us to rigorously test the efficacy of several promising enhancement strategies designed to overcome common challenges in the field of metaheuristics.
Our preliminary analysis confirmed that the baseline IVYA framework, like many population-based algorithms, exhibits distinct and addressable limitations, particularly in maintaining population diversity and handling search stagnation. Therefore, enhancing IVYA provides a valuable case study for exploring the design paradigms of robust optimization algorithms by systematically addressing these general, non-metaphorical deficiencies. To this end, this paper proposes an Enhanced Ivy Algorithm (E-IVYA), which integrates three synergistic strategies. The primary motivations and corresponding engineering contributions–each targeting a specific, well-defined limitation–are summarized as follows:
- A Dynamic Perturbation Framework Combining Symmetric and Asymmetric Exploration: To address the common challenge of maintaining population diversity, a phenomenon that can lead to premature convergence in many metaheuristics, E-IVYA introduces a novel framework that functions as a hierarchical diversity control system. Its foundational component, a symmetric exploration operator based on elite-guided oppositional learning, continuously refines the population structure. This is complemented by a macro-level intervention, an asymmetric shock operator based on an adaptive t-distribution, which is probabilistically triggered during stagnation to introduce radical, non-patterned perturbations, thereby ensuring the population maintains its exploratory potential.
- Dynamic Perturbation Escape Mechanism based on Elite Differential Mutation: To equip the algorithm with a dedicated mechanism for addressing search stagnation, where individuals can become trapped in local optima, we have designed this mechanism. It employs a stagnation counter to detect periods of non-improvement. When stagnation is identified, it triggers an intelligent perturbation derived from the differential vectors of elite individuals in the current population. This creates a guided, high-potential leap away from the local trap, effectively reactivating the search process by leveraging the collective intelligence of the best-found solutions.
- An Adaptive Movement and Balancing Mechanism Based on Trigonometric Functions: To achieve a more adaptive and dynamic balance between search behaviors according to the evolving requirements of the optimization process, E-IVYA incorporates this mechanism. It integrates the periodic oscillating properties of sine and cosine functions into the position update formula. Coupled with a parameter that decays dynamically with iterations, this mechanism enables a seamless transition from large-scale, exploratory movements in the early phases to small-scale, precise adjustments in the later stages, thus optimizing the exploration-exploitation trade-off. By synergistically combining these three targeted improvements, the E-IVYA framework demonstrates significant enhancements in robustness, convergence accuracy, and overall optimization efficiency. This is achieved by systematically engineering solutions to address the primary, well-defined limitations of the baseline algorithm.
Related work
A brief introduction to the Ivy algorithm
The Ivy Algorithm (IVYA) is a novel nature-inspired metaheuristic optimization algorithm ^28^. It draws inspiration from the intelligent behaviors of ivy plants, such as their ability to grow, climb, and intertwine towards sunlight. IVYA aims to provide effective solutions for complex global optimization problems. The algorithm simulates key mechanisms of an ivy population, including initialization, growth strategies, and selection. The main steps of IVYA are as follows:
Step 1 Initialize the Ivy Population and Growth Vectors
Initially, a population of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N_{Pop}$$\end{document} ivy plants is created. Each plant \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I_i$$\end{document} represents a potential solution. The position of each plant is randomly initialized within the search space defined by lower bounds \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I_{min}$$\end{document} and upper bounds \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I_{max}$$\end{document} . Equation (1) shows this initialization:
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} I_{i} = I_{min} + \text {rand}(1,D) \odot (I_{max} - I_{min}) \end{aligned}$$\end{document}where \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I_{i}$$\end{document} is the position vector of the i-th plant, D is the problem’s dimension, and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\text {rand}(1,D)$$\end{document} is a vector of D uniformly distributed random numbers in [0, 1]. The symbol \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\odot$$\end{document} denotes the Hadamard product.
Each plant \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I_i$$\end{document} also possesses a Growth Vector (GV), \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Delta Gv_i$$\end{document} . For the first iteration (Iter=1), the initial GV is calculated using Eq. (2):
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \Delta Gv_{i} = I_{i} \oslash (I_{max} - I_{min}) \end{aligned}$$\end{document}where \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\oslash$$\end{document} denotes the Hadamard division.
Step 2 Ivy Growth and Movement Strategy
In each subsequent iteration, ivy plants update their GVs and generate new positions. This process involves several sub-steps:
Sub-step 2.1. Update Existing Growth Vector:
The GV, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Delta Gv_i$$\end{document} , of the current plant \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I_i$$\end{document} is updated using Eq. (3):
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \Delta Gv_{i}(t+1) = \text {rand}^2 \cdot (N(1,D) \odot \Delta Gv_{i}(t)) \end{aligned}$$\end{document}where \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Delta Gv_{i}(t)$$\end{document} is the GV at iteration t, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\text {rand}$$\end{document} is a scalar random number in [0, 1], and N(1, D) is a vector of D random numbers from the standard normal distribution.
Sub-step 2.2. Generate New Position:
A new position \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I_{i}^{\text {new}}$$\end{document} is generated based on a decision parameter \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\beta = (2 + \text {rand})/2$$\end{document} and a comparison between the fitness \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$f(I_i)$$\end{document} of the current plant and the fitness \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$f(I_{Best})$$\end{document} of the best plant in the population.
- If \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$f(I_i) < \beta \cdot f(I_{Best})$$\end{document} , the plant employs a local search strategy (climbing growth). The new position is generated by learning from a ’more vital neighbor’ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I_{ii}$$\end{document} (selected based on specific rules) as shown in Eq. (4):
where |N(1, D)| contains the absolute values of elements in N(1, D).
- Otherwise, the plant employs a global search strategy (spreading towards light). The new position is primarily guided by the best plant \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I_{Best}$$\end{document} , as shown in Eq. (5):
Sub-step 2.3. Update Growth Vector for the New Position:
After generating \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I_{i}^{\text {new}}$$\end{document} , its corresponding GV, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Delta Gv_{i}^{\text {new}}$$\end{document} , is updated using Eq. (6):
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \Delta Gv_{i}^{\text {new}} = I_{i}^{\text {new}} \oslash (I_{max} - I_{min}) \end{aligned}$$\end{document}Sub-step 2.4. Boundary Handling:
All newly generated positions \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I_{i}^{\text {new}}$$\end{document} are checked and adjusted to ensure they remain within the predefined search space boundaries \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$[I_{min}, I_{max}]$$\end{document} .
Step 3 Population Update and Selection Mechanism
At the end of each iteration, the population from the previous iteration is merged with the newly generated population of ivy plants. This combined population is then sorted based on the fitness values of all individual plants, from best to worst. To maintain a constant population size, the top \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N_{Pop}$$\end{document} plants with the best fitness values are selected to form the population for the next iteration. This elitist selection ensures that promising solutions are preserved and carried forward.
IVYA continues these iterative steps, simulating the intelligent growth behavior of ivy, and balancing exploration with exploitation to converge towards the global optimal solution.
An overview of SCA
The Sine-Cosine Algorithm (SCA) is a metaheuristic algorithm designed to solve optimization problems, particularly those with unknown search spaces ^29^. SCA utilizes the mathematical properties of sine and cosine functions to iteratively update the position of each search agent in the solution space, aiming to find the optimal solution ^30,31^. For a given problem, the set of variables, constraints, and the objective function together define its search space ^32^. This search space may contain multiple local optima, but only one is the global optimum. SCA balances exploration and exploitation to identify promising regions within the search space and ultimately converge to the global optimum ^33^.
The operational mechanism of SCA involves different phases. During the initial exploration phase, SCA combines random solutions and incorporates a high degree of randomness in the movement of search agents. This encourages a broad search to locate promising areas. As the algorithm transitions to the development (exploitation) phase, the changes in solution positions become more gradual. The stochastic variation in this later stage is significantly smaller than in the exploration phase. The update of the optimal solution employs Eqs. (1) or (2) based on the condition of parameter \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r_4^t$$\end{document} .
The core of SCA is its position update mechanism, which dictates how each search agent moves towards the best solution found so far. The process of updating an agent \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$X_i^t$$\end{document} to its new position \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$X_i^{t+1}$$\end{document} is guided by the current best solution \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$P_i^t$$\end{document} (the destination point). This update selectively uses either sine or cosine functions, based on the value of a random parameter \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r_4^t$$\end{document} . The position update equations are as follows:
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} X_i^{t+1}= & X_i^t + r_1^t \times \sin (r_2^t) \times |r_3^t P_i^t - X_i^t|, \quad \text {if } r_4^t < 0.5 \end{aligned}$$\end{document} \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} X_i^{t+1}= & X_i^t + r_1^t \times \cos (r_2^t) \times |r_3^t P_i^t - X_i^t|, \quad \text {if } r_4^t \ge 0.5 \end{aligned}$$\end{document}In these equations, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$X_i^t$$\end{document} is the position of the current solution i at iteration t, and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$P_i^t$$\end{document} is the position of the best solution found up to iteration t. The parameters \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r_1^t, r_2^t, r_3^t,$$\end{document} and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r_4^t$$\end{document} are crucial for guiding the search. Parameter \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r_1^t$$\end{document} determines the region of the next position (or the direction of movement); its value is typically adjusted over iterations to shift from exploration to exploitation. Parameter \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r_2^t$$\end{document} defines the distance the movement should cover towards or away from the target. Parameter \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r_3^t$$\end{document} assigns a random weight to the destination, stochastically emphasizing (if \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r_3^t > 1$$\end{document} ) or de-emphasizing (if \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r_3^t < 1$$\end{document} ) the destination’s influence on the movement distance. Finally, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r_4^t$$\end{document} is a uniformly distributed random number used to switch between the sine and cosine functions. Through the iterative application of these updates and the adaptive nature of its parameters, SCA navigates the search space to find optimal or near-optimal solutions.
Advanced metaheuristic strategies and applications in deep learning
The continuous evolution of the metaheuristic field has led to several key research paradigms aimed at enhancing optimization performance. A prominent research direction focuses on improving a single base algorithm by integrating multiple enhancement mechanisms to systematically address inherent limitations such as premature convergence. For example, Wang et al. ^34^ proposed an improved reptile search algorithm with multi-population evolution for image segmentation, while Jia et al. ^35^ developed a multi-strategy enhanced dung beetle optimizer for engineering design. These studies validate the effectiveness of synergistically combining strategies within a single framework. Our work on E-IVYA follows a similar philosophy, focusing on the internal enhancement of the base IVYA to create a more robust and powerful optimizer.
Another powerful paradigm is the creation of hybrid metaheuristic frameworks, which combine the operators or strategies of two or more distinct algorithms, often to leverage the exploration strengths of one with the exploitation prowess of another. For instance, recent research has explored the FOX-TSA hybridization ^36^, FOXTSA and Foxtsage-type optimizers ^37,38^, which integrate metaheuristic and learning-based refinement mechanisms. In contrast to these high-level hybridization strategies, our proposed E-IVYA focuses on the deep, synergistic integration of multiple mechanisms within a single algorithmic core, allowing for a more seamless and adaptive control over the search process.
The application of these advanced metaheuristics to automate the tuning of deep learning models, particularly Generative Adversarial Networks (GANs), is a challenging and highly relevant research frontier. Due to the high-dimensional and computationally expensive nature of the hyperparameter search space, metaheuristics offer a significant advantage over manual tuning. Recent studies have demonstrated this potential: an evolutionary approach was used for hyperparameter tuning of a GAN for melanoma detection ^39^, and a Variable Neighborhood Search (VNS) was applied to a GAN-based seismic interpolator ^40^. Beyond just tuning, metaheuristics like Particle Swarm Optimization (PSO) have been used for GAN architecture search ^41^ and to optimize ANN models ^42^. These works confirm the viability of applying metaheuristics to complex neural network optimization tasks. Our research contributes to this domain by applying a novel, multi-strategy enhanced algorithm (E-IVYA) to the GAN hyperparameter optimization problem, demonstrating its practical utility in solving computationally expensive, real-world automated machine learning challenges.
Methods
A dynamic perturbation framework combining symmetric and asymmetric exploration
To systematically address the decay of population diversity that can lead to premature convergence, this paper introduces a novel dynamic perturbation framework, conceptually illustrated in Fig. 1. This framework integrates two mechanistically distinct yet complementary operators to form a resilient diversity maintenance system.
The foundational component is a symmetric exploration operator, implemented via an Elite-Guided Opposition-Based Learning (EOL) mechanism. This operator establishes a dynamic center of symmetry guided by the current global best solution, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I_{\text {Best}}$$\end{document} , to generate new candidate solutions. The scope of this symmetric exploration is controlled by an adaptive scaling factor, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$m_{\text {obl}}$$\end{document} , as defined in Eq. 9, while the new position is calculated according to Eq. 10.
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} m_{\text {obl}}= & (1 + \text {iter}/\text {MaxIter})^{c1} \end{aligned}$$\end{document} \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} I'_{kj}= & C_j + (C_j - I_{\text {Best},j}) / m_{\text {obl}} - (I_{kj} - C_j) / m_{\text {obl}} \end{aligned}$$\end{document}This is complemented by an asymmetric perturbation operator, which acts as a powerful instrument for breaking convergence inertia. This operator is designed to introduce a high-magnitude, non-patterned stochastic shock using a t-distribution, which is particularly effective for escaping local optima. Its behavior is governed by a dynamically evolving degrees of freedom parameter, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$t_r$$\end{document} , shown in Eq. 11. When probabilistically triggered, a new position is generated using Eq. 12.
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} t_r= & \exp (1 + (1 + \text {iter}/\text {MaxIter})^{c2}) \end{aligned}$$\end{document} \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} I''_{kj}= & C_j + (I_{\text {Best},j} - C_j) \cdot \text {rand}() \cdot S_j \end{aligned}$$\end{document}where \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$S_j$$\end{document} is a random sample drawn from a t-distribution with \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$t_r$$\end{document} degrees of freedom. The dynamic interplay between these two operators, as detailed in the pseudocode in Algorithm 1 (lines 21-34), endows E-IVYA with a superior diversity maintenance capability.Fig. 1A Dynamic Perturbation Framework Combining Symmetric and Asymmetric Exploration.
Dynamic perturbation escape mechanism based on elite differential mutation
To address the critical challenge of search stagnation, where individuals become trapped in local optima, we designed a Dynamic Perturbation Escape Mechanism based on Elite Differential Mutation. The core idea is to leverage the collective intelligence of the elite subpopulation to guide the search out of a suboptimal region, rather than resorting to blind or random perturbations.
The mechanism is triggered when a stagnation counter, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\text {stagnation\_count}$$\end{document} , exceeds a predefined threshold, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\text {MaxStagnation}$$\end{document} . As outlined in Algorithm 1 (lines 42-52), the process involves three key steps. First, a Dynamic Elite Archive (DEA) is constructed by selecting the top \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p\%$$\end{document} of individuals from the current population:
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} DEA = \{I_i | I_i \in \text {Top}_p(\text {Pop})\} \end{aligned}$$\end{document}Second, inspired by the principles of Differential Evolution, an Elite Differential Perturbation Vector ( \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$V_{\text {edp}}$$\end{document} ) is generated using three distinct elite individuals ( \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I_{r1}, I_{r2}, I_{r3}$$\end{document} ) randomly selected from the DEA:
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} V_{\text {edp}} = I_{r1} + F \cdot (I_{r2} - I_{r3}) \end{aligned}$$\end{document}where F is a scaling factor. Finally, this vector is applied to a targeted stagnated individual \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I_i$$\end{document} to calculate its new escape position:
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} I_{i}^{\text {new}} = I_i + V_{\text {edp}} \end{aligned}$$\end{document}This process creates a guided, high-potential leap away from the local trap. The differential vector, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$F \cdot (I_{r2} - I_{r3})$$\end{document} , provides the necessary exploratory step, while the base vector \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I_{r1}$$\end{document} anchors the perturbation in a high-performance region, thereby maintaining a strong exploitative tendency. This intelligent fusion of exploration and exploitation significantly enhances the global search performance and overall robustness of E-IVYA.
Adaptive movement strategy integrating sine-cosine algorithm principles
To address the relatively static exploration-exploitation balance in the original IVYA, we introduce a more adaptive individual movement strategy by integrating the core principles of the Sine-Cosine Algorithm (SCA). This enhancement, conceptually illustrated in Fig. 2, replaces the two original position update equations with new formulas that provide a dynamic and smooth transition between global and local search.
The selection between the local and global search modes still follows IVYA’s original \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\beta$$\end{document} condition logic. If the local growth condition is met, the new position \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I_i^{\text {new}}$$\end{document} is updated according to Eq. 16:
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} I_i^{\text {new}} = I_i + r_1 \cdot \sin (r_2) \odot (|N(1,D)| \odot (I_{ii} - I_i)) + r_1 \cdot \cos (r_3) \odot (N(1,D) \odot \Delta Gv_i) \end{aligned}$$\end{document}Otherwise, the individual executes a global exploratory movement regulated by the SCA mechanism, as shown in Eq. 17:
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} I_i^{\text {new}} = I_i + r_1 \cdot \sin (r_2) \odot (r_4 \cdot I_{\text {Best}} - I_i) + r_1 \cdot \cos (r_3) \odot \Delta Gv_i \end{aligned}$$\end{document}The key to this adaptive behavior lies in the SCA adjustment parameters. Parameter \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r_1$$\end{document} is a dynamic factor calculated as \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r_1 = a - a \cdot (\text {iter}/\text {MaxIter})$$\end{document} (where \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$a=2$$\end{document} ), which linearly decreases over iterations. This systematically reduces the movement step size, guiding the algorithm from broad exploration to fine-grained exploitation. Parameters \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r_2, r_3 \in [0, 2\pi ]$$\end{document} and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r_4 \in [0, 1]$$\end{document} are random numbers that introduce diverse oscillatory characteristics and stochasticity to the search process. This adaptive update is performed for each individual in the main loop of the algorithm, as shown in Algorithm 1 (line 16), achieving a significantly more flexible and efficient dynamic equilibrium between the search phases.Fig. 2. Adaptive Movement Strategy Integrating Sine-Cosine Algorithm Principles.
E-IVYA: an enhanced Ivy algorithm
The Enhanced Ivy Algorithm (E-IVYA) significantly refines the original IVYA by integrating three core mechanisms that operate synergistically. This integration is designed to overcome the inherent limitations of IVYA when addressing complex optimization problems. These mechanisms work together to elevate the algorithm’s overall performance.
First, the Elite-Guided Opposition-Based Learning and Adaptive Perturbation Enhancement Strategy systematically bolsters population diversity. It achieves this by combining elite-information-guided opposition-based learning with probabilistic t-distribution perturbations. This strategy broadens the algorithm’s effective search range and mitigates the risk of premature convergence. Second, the Stagnation Response and Adaptive Growth Direction Adjustment Mechanism markedly improves the algorithm’s capacity to escape local optimal regions. This is accomplished by monitoring the algorithm’s convergence state in real-time and proactively adjusting the growth update direction of certain individuals when search stagnation is detected, thereby re-energizing the exploration process. Third, the Adaptive Movement Strategy Integrating Sine-Cosine Algorithm Principles optimizes IVYA’s individual movement methodology. It introduces dynamically varying adjustment parameters that change with iterations and incorporates the periodic oscillatory characteristics of trigonometric functions. This leads to a more flexible and efficient dynamic equilibrium between global exploration and local exploitation. The organic combination and synergistic effect of these three key mechanisms endow E-IVYA with stronger global optimization capabilities, superior convergence characteristics, and increased robustness when solving complex continuous optimization problems. This effectively addresses the potential deficiencies of the original IVYA.
Algorithm 1The pseudo-code of the E-IVYA
Computational complexity analysis of E-IVYA
An analysis of the computational complexity is essential for evaluating the efficiency of the proposed E-IVYA. The complexity of the original IVYA is primarily determined by population initialization, fitness evaluation, and sorting, resulting in an overall complexity of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$O(\text {MaxIter} \cdot (N_{\text {Pop}} \log N_{\text {Pop}} + N_{\text {Pop}} \cdot D + N_{\text {Pop}} \cdot f))$$\end{document} .
The E-IVYA introduces three enhancement mechanisms. The SCA-based movement strategy and the EOL mechanism involve vector operations, adding a complexity of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$O(N_{\text {Pop}} \cdot D)$$\end{document} per iteration. The elite differential mutation escape mechanism involves sorting to build the elite archive ( \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$O(N_{\text {Pop}} \log N_{\text {Pop}})$$\end{document} ) and performing vector operations on a subset of the population. Therefore, the overall computational complexity of E-IVYA for a complete run remains in the same order as the original, approximated as \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$O(\text {MaxIter} \cdot (N_{\text {Pop}} \log N_{\text {Pop}} + N_{\text {Pop}} \cdot D + N_{\text {Pop}} \cdot f))$$\end{document} .
While the enhancement strategies introduce a modest increase in computational overhead per iteration compared to the baseline IVYA, this cost is comparable to that of many other state-of-the-art metaheuristic algorithms. As demonstrated by the experimental results, this slight increase in computational cost yields a significant improvement in optimization performance, particularly in convergence speed and solution accuracy. This represents a favorable trade-off, especially for tackling complex and computationally expensive AI problems where solution quality is paramount.
Experimental environment and setting
Experimental fundamentals
To rigorously evaluate the comprehensive performance of the E-IVYA algorithm proposed in this study, all experiments were conducted on a unified computing platform to ensure a fair comparison. The experimental environment consisted of a Windows 11 operating system, with the algorithm implementation and simulations based on MATLAB R2021b. The hardware configuration of the computer used was an Intel(R) Core(TM) i7-11700K CPU @ 3.60GHz, equipped with 32 GB of RAM. This evaluation selected two internationally recognized standard test suites, IEEE CEC 2014 and IEEE CEC 2017, to examine the performance of E-IVYA on optimization problems with varying levels of complexity and different characteristics ^43^. Considering the inherent stochastic nature of metaheuristic algorithms, this study employs the non-parametric Wilcoxon signed-rank test. This statistical method is utilized to analyze the significance of performance differences among the algorithms, thereby providing stronger statistical support for the experimental conclusions.
Comparison algorithms
To comprehensively and objectively validate the performance and effectiveness of the proposed E-IVYA algorithm, this study selected a series of representative metaheuristic algorithms for a comprehensive performance comparison. The selection of these algorithms covers the original baseline algorithm, classic metaheuristics, advanced variants of differential evolution, and several recently proposed novel algorithms. This approach aims to rigorously evaluate the competitiveness of E-IVYA from multiple perspectives. The specific comparison algorithms selected are as follows:
Original Baseline Algorithm Original Ivy Algorithm (IVYA) is included as the most important baseline for comparison ^28^. The purpose is to directly and clearly measure the comprehensive performance improvement brought by the various enhancement strategies proposed in this study.
Classic and Widely-Used Metaheuristic Algorithms Particle Swarm Optimization (PSO) ^18^, Grey Wolf Optimizer (GWO) ^44^, Differential Evolution (DE) ^17,45^, and Reptile Search Algorithm (RSA) were chosen because they are classic methods in the optimization field that have been long-validated, are widely used, and have recognized performance ^46^. Comparing E-IVYA with these algorithms allows for an evaluation of its performance advantages against standard metaheuristics.
Advanced Differential Evolution Variants: Adaptive Differential Evolution with Optional External Archive (JADE) and Success-History based Adaptive DE with Linear Population Size Reduction (LSHADE) are included ^47,48^. Differential Evolution is a powerful paradigm for solving continuous optimization problems. As highly successful variants with advanced adaptive mechanisms, JADE and LSHADE represent a high level of performance in the field of evolutionary algorithms. Comparison with them helps to test the competitiveness of E-IVYA against top-tier evolutionary algorithms when tackling complex optimization problems.
Recently Proposed Novel Optimization Algorithms: Animated Oat Optimization (AOO) algorithm ^49^, Liver Cancer algorithm (LCA) ^50^, and Water Uptake and Transport in Plants (WUTP) algorithm are selected ^51^. To examine the performance of E-IVYA in comparison to recent trends in optimization algorithm research, this study selected these newer algorithms. Comparing with them allows for a better assessment of E-IVYA’s novelty and its advancement within the current context of algorithm development.
By conducting a comprehensive comparison with the carefully selected algorithms listed above, the performance level of E-IVYA can be more reliably positioned. This also enables a deeper analysis of the specific contributions of its various enhancement mechanisms under different types of optimization challenges. The algorithm parameter Settings are shown in Table 1.Table 1. Parameter settings for the proposed E-IVYA and the comparison algorithms.AlgorithmParameterValueE-IVYASCA initial control parameter (a)2DE scaling factor (F)0.5Elite archive percentage (p)0.1Stagnation threshold (MaxStagnation)30Asymmetric perturbation probability ( \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$t_k$$\end{document} )0.05Symmetric exploration exponent (c1)2Asymmetric exploration exponent (c2)2IVYABehavioral switch parameter ( \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\beta$$\end{document} ) \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1 + \text {rand}/2 \in [1, 1.5]$$\end{document} PSOAcceleration constants0.2GWO \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\vec {a}$$\end{document} linearly decreased from 2 to 0DEcrossover rate0.8scale factor primary0.6use mutation scheme1use sorted selection0RSAEvolutionary Sense \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$2 \times \text {randn} \times (1-(iter/maxiter))$$\end{document} Sensitive parameter controlling the exploration accuracy0.05Sensitive parameter controlling the exploitation accuracy0.1JADEc0.1UCR0.5UF0.5top1LSHADE \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r^{N^{init}}$$\end{document} 30 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r^{arc}$$\end{document} 1.4p0.11Historical memory size (H)5AOOLevy flight distribution parameter ( \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\beta$$\end{document} )1.5Decay factor exponent3Elite group divisor10LCATumor growth rate (p)2/3Levy flight distribution parameter ( \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\beta$$\end{document} )1.5WUTPAlternate parameter (p)0.5Probability threshold (rr)0.1Control parameter ( \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\chi$$\end{document} )0.5
Evaluation metrics
In this study, four main metrics were employed to evaluate the performance of the proposed E-IVYA algorithm: diversity, exploration, exploitation, and the convergence curve ^52–54^. These metrics are designed to assess the algorithm’s ability to explore the search space, its performance in global search (exploration), and its effectiveness in local search within identified potential optimal regions (exploitation).
Diversity Evaluation. Evaluating the population diversity of E-IVYA is crucial for measuring its ability to effectively explore and exploit the search space. This diversity is quantified using Equations (X) and (Y):
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \text {Diversity}= & \frac{1}{N_{Pop}} \sqrt{ \sum _{i=1}^{N_{Pop}} \sum _{j=1}^{D} (I_{ij} - \bar{I}_j)^2 } \end{aligned}$$\end{document} \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \bar{I}_j= & \frac{1}{N_{Pop}} \sum _{i=1}^{N_{Pop}} I_{ij} \end{aligned}$$\end{document}Here, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N_{Pop}$$\end{document} is the total population size, D is the problem dimension, and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\bar{I}_j$$\end{document} is the average position of the population in the j-th dimension.
Exploration and Exploitation Evaluation Based on population diversity, the exploration and exploitation behaviors can be defined by Equations (Z) and (W):
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \text {Exploration}= & \frac{\text {Diversity}}{\text {Diversity}_{max}} \end{aligned}$$\end{document} \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \text {Exploitation}= & \frac{|\text {Diversity}_{max} - \text {Diversity}|}{\text {Diversity}_{max}} \end{aligned}$$\end{document}where \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\text {Diversity}$$\end{document} is the population diversity of the current iteration, and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\text {Diversity}_{max}$$\end{document} is the maximum diversity value recorded throughout the entire run.
Convergence Curve The convergence curve is key to evaluating an algorithm’s performance, as it visually demonstrates the rate at which the algorithm converges towards the optimal solution during the iterative process. Typically, the convergence curve plots the relationship between the objective fitness value and the number of iterations. The slope of the curve can serve as an indicator of the convergence speed towards the optimum. In the initial phase of the curve, a significant drop usually indicates exploration activities, whereas subsequent phases show more gradual improvements, representing the exploitation capability of the search process.
Results and discussion
This chapter aims to comprehensively evaluate the performance of the proposed Enhanced Ivy Algorithm (E-IVYA), based on the experimental environment and setup detailed in Section 5. To clearly demonstrate the advantages of E-IVYA, this chapter is organized as follows: First, Experiment 1 provides a qualitative analysis of E-IVYA’s search behavior, with a focus on its population diversity and exploration-exploitation dynamics. Subsequently, Experiment 2 and Experiment 3 present extensive quantitative comparisons of E-IVYA’s performance against other algorithms on the IEEE CEC 2014 and CEC 2017 benchmark test suites. Building on this, Experiment 4 validates the significance of the performance differences through rigorous statistical tests.
Experiment 1: diversity, exploration and exploitation analysis
Diversity Analysis Population diversity is a critical metric for evaluating a metaheuristic algorithm’s capacity to explore the search space thoroughly and avoid premature convergence to local optima. This analysis verifies the effectiveness of E-IVYA’s enhancement mechanisms in maintaining exploratory vitality. To assess this capability, three representative functions were selected: the unimodal function F1, the multimodal function F8, and the composite function F21. The experimental results indicate that E-IVYA demonstrates a robust ability to maintain population diversity throughout the iterative process. Particularly when dealing with complex multimodal and composite functions, although diversity naturally decreases as the algorithm converges on promising regions, E-IVYA periodically injects new vitality into the population. This capability is primarily attributed to the ’Elite-Guided Opposition-Based Learning and Adaptive Perturbation Enhancement Strategy,’ which allows the population diversity to recover from temporary declines, thereby preventing search stagnation. The diversity change curves for these functions are presented in Fig. 7.
Exploration and Exploitation Analysis The balance between exploration and exploitation is a core indicator of a metaheuristic algorithm’s performance. This section evaluates E-IVYA’s ability to dynamically coordinate these two search behaviors throughout the optimization process, using the unimodal function F1, multimodal function F8, and composite function F23 for assessment. Compared to the original IVYA, E-IVYA exhibits a significantly improved ability to manage this balance. When handling unimodal problems, it transitions rapidly from exploration to exploitation for fast convergence. Conversely, when facing more complex multimodal and composite functions, it sustains a more effective equilibrium between the two behaviors. This is attributed to the ’Adaptive Movement Strategy Integrating SCA Principles,’ which facilitates a high level of exploration in the early stages and smoothly shifts to exploitation for fine-tuning solutions later. While a perfect balance remains a challenge in high-dimensional problems, the dynamic adjustment capability of E-IVYA is demonstrably to that of the baseline algorithm, as shown in Fig. 8.
In summary, the experimental results clearly demonstrate that the enhancement strategies integrated into E-IVYA synergistically improve its performance. The ’Elite-Guided Opposition-Based Learning and Adaptive Perturbation Enhancement Strategy’ is principally responsible for maintaining high population diversity. Concurrently, the ’Adaptive Movement Strategy Integrating SCA Principles,’ through its dynamic parameter \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r_1$$\end{document} , achieves an effective and smooth transition between exploration and exploitation. Furthermore, the ’Stagnation Response and Adaptive Growth Direction Adjustment Mechanism’ provides a critical escape capability, ensuring the continuity of the search process when the algorithm stagnates. The collective effect of these mechanisms significantly enhances E-IVYA’s robustness and efficiency in solving complex optimization problems.
Experiment 2: algorithm performance comparison on IEEE CEC 2014
To evaluate the performance of E-IVYA under different dimensions, this section presents a comprehensive comparison with a series of comparison algorithms on the IEEE CEC 2014 test suite. The experiments were conducted under both 50-dimension (50D) and 100-dimension (100D) settings. All algorithms were run independently for 30 rounds in a unified environment, with 500 iterations per round. The detailed quantitative results of the experiments are recorded in Table 4 (50D) and Table 5 (100D), while the convergence curves for some representative functions are shown in Fig. 9 (50D) and Fig. 10 (100D).
The experimental results clearly show that E-IVYA demonstrates a leading performance on most test functions, leading in the overall rankings under both 50D and 100D challenges. Its superior performance can be attributed to the synergistic effect of its three innovative mechanisms. For unimodal functions, the ’Adaptive Movement Strategy Integrating SCA Principles’ ensures fast and precise convergence through refined local search. For the more complex multimodal, hybrid, and composite functions, the ’Elite-Guided Opposition-Based Learning and Adaptive Perturbation Enhancement Strategy’ and the ’Stagnation Response Mechanism’ work together to maintain population diversity and effectively escape local optima. The organic combination of these mechanisms enables E-IVYA to efficiently balance exploration and exploitation, thereby consistently obtaining high-quality solutions in various complex optimization problems.
Experiment 3: algorithm performance comparison on IEEE CEC 2017
To further examine the performance and robustness of E-IVYA on more challenging optimization problems, this subsection utilizes the IEEE CEC 2017 benchmark test suite for evaluation. This test suite also includes unimodal, multimodal, hybrid, and composite functions, but it introduces more complex rotations, shifts, and combinations in their characteristics, posing higher demands on the comprehensive optimization capabilities of an algorithm. The comparative analysis in this section continues with the previous experimental setup, also conducted under 50-dimension (50D) and 100-dimension (100D) settings.
The detailed quantitative comparison results are presented in Table 6 (50D) and Table 7 (100D), with the corresponding convergence curves shown in Fig. 11 (50D) and Fig. 12 (100D). The experimental results demonstrate that E-IVYA continues to exhibit strong optimization capabilities in the more complex CEC 2017 test environment, achieving the best fitness values on most functions in both dimensions. This success is attributed to the robust balance between exploration and exploitation in its design. It is noteworthy that the group of comparison algorithms includes advanced differential evolution variants such as JADE and LSHADE. E-IVYA still shows a significant advantage in competition with these top-tier algorithms, with its performance being particularly outstanding in high-dimensional scenarios.
The excellent performance of E-IVYA on the CEC 2017 test suite further confirms the effectiveness of its combined enhancement strategies. The ’Elite-Guided Opposition-Based Learning and Adaptive Perturbation Enhancement Strategy’ maintains necessary population diversity by continuously optimizing the population structure, while the ’Adaptive Movement Strategy Integrating SCA Principles’ and the ’Stagnation Response Mechanism’ together ensure the algorithm’s efficient exploration and stable convergence in complex search landscapes. The results in Tables 8 and 9 consistently demonstrate that E-IVYA achieves remarkably low p-values when compared against the vast majority of algorithms on both the CEC 2014 and the more complex CEC 2017 test suites. At the conventional significance levels of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\alpha =0.1$$\end{document} and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\alpha =0.05$$\end{document} , nearly all* p*-values are well below the threshold, allowing for the rejection of the null hypothesis. This indicates that the performance advantage of E-IVYA is statistically significant and not a result of random chance. These findings provide strong statistical support for the conclusion that the proposed E-IVYA algorithm is significantly superior to the other comparison algorithms selected for this study. These results provide strong evidence that E-IVYA, as a robust optimization method, has strong potential for solving challenging and complex optimization problems.
Experiment 4: statistical experiments
To statistically validate the performance advantage of E-IVYA over its counterparts, this section employs the Wilcoxon signed-rank test ^55^. This non-parametric method is well-suited for evaluating the differences between paired samples, as it does not assume a normal data distribution. The analysis utilizes the data from the 30 independent runs for each algorithm on the IEEE CEC 2014 and IEEE CEC 2017 test suites, ensuring consistency with the previous experimental sections.
Tables 8 and 9 present the results of the pairwise comparisons between E-IVYA and each of the other algorithms, detailing the calculated p-values. The fundamental argument of this study is that E-IVYA’s performance is significantly superior. In this context, a p-value below a preset significance level ( \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\alpha$$\end{document} ) permits the rejection of the null hypothesis–which posits no significant performance difference–and thus confirms that E-IVYA’s advantage is statistically significant.
The results in Tables 8 and 9 consistently demonstrate that E-IVYA achieves remarkably low p-values when compared against the vast majority of algorithms on both the CEC 2014 and the more complex CEC 2017 test suites. Consequently, at the conventional significance levels of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\alpha =0.1$$\end{document} and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\alpha =0.05$$\end{document} , the superiority of E-IVYA receives strong statistical support. These analytical findings support the conclusion that the proposed E-IVYA algorithm is significantly superior, in terms of performance, to the other comparison algorithms selected for this study.
Experiment 5: ablation study of E-IVYA components
To rigorously validate the individual contribution of each enhancement mechanism proposed in this paper, an ablation study was conducted. This experiment aims to deconstruct the E-IVYA framework and quantify the performance impact of its three core components: the Dynamic Perturbation Framework (DPF), the Elite Differential Mutation (EDM) escape mechanism, and the Adaptive Movement Strategy integrating SCA principles (SCA). For this purpose, we compared the performance of the baseline IVYA, the final integrated E-IVYA, and three intermediate variants, each incorporating only a single enhancement mechanism. The experiments were performed on ten representative CEC 2014 benchmark functions and the challenging Credit-Card Fraud dataset for GAN hyperparameter optimization.
The detailed quantitative results of this ablation study are presented in Table 10 and Figs. 13 and 14. The findings clearly indicate that each proposed mechanism provides a distinct and positive contribution to the algorithm’s performance. As shown in the table, all three single-mechanism variants (IVYA+DPF, IVYA+EDM, and IVYA+SCA) consistently outperform the original IVYA across all test platforms. For instance, on the complex composite function F21, the baseline IVYA achieved a mean fitness of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$6.66 \times 10^6$$\end{document} , whereas the IVYA+EDM and IVYA+SCA variants improved this result to \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1.37 \times 10^6$$\end{document} and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1.27 \times 10^5$$\end{document} , respectively. This highlights the significant role of the elite-guided escape strategy and the adaptive movement mechanism in navigating complex, multi-modal landscapes. This trend is further confirmed in the GAN optimization task, where all variants surpassed the baseline F1-Score of 0.82, with IVYA+EDM achieving a notable score of 0.85.
Most importantly, the results demonstrate a clear and significant synergistic effect among the three strategies. The complete E-IVYA model, which integrates all components, achieved the best performance on every single test case, significantly surpassing all intermediate variants. On the unimodal function F1, E-IVYA’s result ( \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$3.35 \times 10^6$$\end{document} ) is an order of magnitude better than the best-performing variant, IVYA+EDM ( \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1.70 \times 10^7$$\end{document} ), showcasing a substantial improvement in convergence precision. This synergistic behavior can be attributed to the complementary roles of the mechanisms: the DPF maintains a healthy population diversity, providing a robust foundation for the SCA’s adaptive exploration and exploitation, while the EDM acts as a crucial failsafe, providing a guided escape from local optima that may still temporarily trap the search. In conclusion, this ablation analysis provides compelling evidence that the performance superiority of E-IVYA is not due to a single dominant component, but rather the well-designed, synergistic integration of all three enhancement strategies, validating the core design philosophy of this work.
Experiment 6: computational cost analysis
To supplement the theoretical complexity analysis (Section 4.4), an empirical runtime analysis was conducted. This experiment aims to rigorously quantify the practical “wall-clock time” overhead introduced by the enhancement mechanisms. For this purpose, we compared the average runtime of the baseline IVYA, the final integrated E-IVYA, and the key competitors (PSO and JADE). The experiments were performed on three representative CEC 2014 benchmark functions (F1, F8, and F21 at 50D) and the computationally intensive Credit-Card Fraud dataset for GAN hyperparameter optimization. All tests were executed on the unified hardware platform detailed in Section 5.1.
The detailed quantitative results of this runtime analysis are presented in Table 11. The findings clearly indicate that E-IVYA introduces a modest and predictable increase in computational cost, which is consistent with its enhanced search capabilities. As shown in the table, on the CEC 2014 benchmarks, E-IVYA required approximately 1.36x to 1.37x the runtime of the original IVYA, depending on the function’s complexity. For instance, on the composite function F21, the baseline IVYA required 81.33 seconds, whereas the complete E-IVYA model required 110.61 seconds. This additional overhead is an expected consequence of its three synergistic mechanisms (DPF, EDM, and SCA), which perform additional calculations per iteration (e.g., elite archive construction, t-distribution sampling, and trigonometric updates). Notably, this computational cost is comparable to other advanced algorithms like JADE, which also required more time (e.g., 1.14x) than the baseline IVYA.
Most importantly, the results demonstrate a clear and acceptable trade-off between solution quality and computational cost, which we explicitly acknowledge in the Limitations section. The baseline IVYA and classic PSO are faster (e.g., 1.00x and 0.81x-0.84x on CEC tasks, respectively), but this speed is achieved at the significant expense of performance, leading to premature convergence. The moderate runtime increase of E-IVYA is the necessary investment for its superior convergence, its robust ability to escape local optima (validated in Experiment 5), and its significantly better final solutions. This trend is further confirmed in the GAN optimization task, where a 1.26x increase in runtime delivered the best-in-class F1-Score. In conclusion, this runtime analysis provides compelling evidence that the performance superiority of E-IVYA is not achieved at a prohibitive computational cost; rather, it represents a well-justified and highly favorable efficiency trade-off for tackling complex, real-world optimization problems.
Experiment 7: parameter sensitivity analysis
To address the reviewer’s request regarding parameter stability and to further validate the robustness of E-IVYA, we conducted a sensitivity analysis on the two key control parameters introduced by the Elite Differential Mutation (EDM) mechanism: the elite archive percentage p and the scaling factor F. We employed a One-At-a-Time (OAT) methodology, varying one parameter while keeping the other at its default value. The analysis was performed on the three representative CEC 2014 functions (F1, F8, and F21) used in the previous analyses.
The results for the elite archive percentage p (while holding \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$F=0.5$$\end{document} ) are presented in Fig. 15a, b, and c. The findings are highly conclusive. On the unimodal F1 and composite F21 functions (Fig. 15a, c), the default value of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p=0.1$$\end{document} (red line) significantly outperforms all other settings. A slightly smaller value of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p=0.05$$\end{document} (magenta line) provides the second-best performance. Critically, any value of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p \ge 0.2$$\end{document} (blue, cyan, green lines) causes a dramatic degradation in performance, leading to premature stagnation. This strongly suggests that a large p “pollutes” the Dynamic Elite Archive (DEA) with non-elite solutions, which in turn degrades the quality of the differential perturbation vector \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$V_{edp}$$\end{document} . While the algorithm is less sensitive on the multimodal F8 (Fig. 15b), \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p=0.1$$\end{document} and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p=0.05$$\end{document} still hold a marginal advantage. This analysis confirms that \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p=0.1$$\end{document} is an optimal and robust “sweet spot,” providing sufficient diversity for the mutation vector without compromising its elite-guiding capability.
The convergence curves for varying scaling factor F (while holding \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p=0.1$$\end{document} ) are presented in Fig. 16a, b, and c. This parameter controls the magnitude of the differential perturbation and presents a clear trade-off. On the complex, high-dimensional functions (F1 and F21, Fig. 16a, c), a very small factor of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$F=0.3$$\end{document} (blue line) provides insufficient perturbation and leads to the worst performance. Larger values, such as \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$F=0.8$$\end{document} (green line) or \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$F=1.0$$\end{document} (magenta line), demonstrate strong exploratory power, with \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$F=0.8$$\end{document} achieving the best final result on F1 and F21. However, on the multimodal F8 (Fig. 16b), our default value of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$F=0.5$$\end{document} (red line) achieves the best (albeit marginal) result, indicating superior performance in navigating complex multimodal landscapes without excessive disruption. This confirms that our default choice of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$F=0.5$$\end{document} , a standard and effective value in Differential Evolution literature, provides the most robust and well-balanced performance across all three distinct function types, validating it as a strong default setting.
In summary, this sensitivity analysis confirms that E-IVYA is not overly sensitive to its key parameters, and the default values chosen in Table 1 are well-justified, providing robust and effective performance across a diverse range of problem landscapes.
Optimizing GAN parameters to improve imbalanced data classification
E-IVYA-GAN: a framework for gan hyperparameter optimization
To leverage the powerful optimization capabilities of E-IVYA for solving the complex hyperparameter selection problem in GAN training, this study proposes an integrated optimization framework named E-IVYA-GAN. This framework aims to automatically and efficiently search for the optimal combination of GAN hyperparameters. Its ultimate goal is to enhance the quality and diversity of minority-class data samples generated by the GAN, thereby significantly improving the performance of a downstream classifier on imbalanced data classification tasks. Essentially, E-IVYA-GAN formalizes the process of GAN hyperparameter tuning as a complex optimization problem that can be solved by a metaheuristic algorithm. The overall workflow of this framework is illustrated in Fig. 3. The table of hyperparameters to be optimized for GAN is shown in Table 2.Fig. 3. Flowchart of the E-IVYA-GAN hyperparameter optimization framework.
Optimization framework workflow and individual encoding
Within this framework, the GAN hyperparameter optimization problem must first be mapped into a format that E-IVYA can solve. Specifically, a set of GAN hyperparameters to be optimized is encoded as an “ivy individual” within the E-IVYA population. Each individual is a D-dimensional real-valued vector \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I = (h_1, h_2, \dots , h_D)$$\end{document} , where each dimension \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$h_j$$\end{document} corresponds to a specific hyperparameter (e.g., generator learning rate \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$lr_G$$\end{document} , discriminator learning rate \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$lr_D$$\end{document} , Adam optimizer’s \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\beta _1$$\end{document} parameter, etc.). E-IVYA will search for the optimum within the predefined search space for these hyperparameters, which is defined by VarMin and VarMax.
The overall optimization process of the E-IVYA-GAN framework follows a standard metaheuristic optimization loop:
- E-IVYA initializes a population of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N_{Pop}$$\end{document} individuals, each representing a random combination of hyperparameters.
- Each individual in the population is evaluated using a carefully designed fitness function to quantify its quality.
- Based on the fitness values of all individuals, E-IVYA applies its unique enhanced growth and update strategies (as described in Sections 3.1-3.3) to generate a new generation of hyperparameter combinations.
- Steps (2) and (3) are repeated until a preset maximum number of iterations is reached or other termination criteria are met. Finally, the algorithm outputs the individual with the best fitness value, which represents the optimal GAN hyperparameter combination.Table 2. Hyperparameters Optimized by E-IVYA for GANs, their Search Ranges, and Rationale.HyperparameterSymbolSearch Range [Min, Max]RationaleGenerator learning rate \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$lr_G$$\end{document} \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$[1 \times 10^{-5}, 1 \times 10^{-3}]$$\end{document} Covers values commonly found in GAN literature for stable training.Discriminator learning rate \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$lr_D$$\end{document} \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$[1 \times 10^{-5}, 1 \times 10^{-3}]$$\end{document} Symmetric to \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$lr_G$$\end{document} ; allows for exploring both balanced and unbalanced rates.Adam optimizer beta1 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\beta _1$$\end{document} [0.1, 0.9]Encompasses the default (0.9) and GAN-specific stable values (e.g., 0.5).Latent vector dimension \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$z_{\text {dim}}$$\end{document} [50, 200] (Integer)Standard range for balancing model capacity and training efficiency.
Fitness function design
The design of the fitness function is the absolute core of the E-IVYA-GAN framework. It builds a bridge between the optimizer (E-IVYA) and the artificial intelligence task (improving imbalanced data classification performance). The key objective of this fitness function is to accurately quantify the ultimate effectiveness of a given set of GAN hyperparameters in solving the class imbalance problem. Therefore, it does not evaluate the internal loss function values from the GAN’s training process (such as generator or discriminator loss). Instead, it assesses the actual contribution to the performance of an independent, downstream classification task after using the GAN to generate data.
For any individual in the E-IVYA population (i.e., a set of GAN hyperparameters to be evaluated), its fitness value is calculated through the following rigorous, automated evaluation process:
- GAN Model Training Using the hyperparameter combination represented by the current individual, a GAN model is trained on a fixed architecture using the original imbalanced training dataset.
- Synthetic Sample Generation After the GAN is trained, its generator is used to create a specified number of new, synthetic data samples for the minority class.
- Dataset Reconstruction The generated synthetic minority samples are merged with the original training dataset to construct a new, more class-balanced augmented training set.
- Downstream Classifier Training A downstream classifier with a fixed architecture and parameters (e.g., a standard convolutional neural network) is trained on this augmented training set. Keeping the classifier constant ensures that all comparisons only reflect the quality of the GAN hyperparameters.
- Performance Validation and Fitness Calculation The trained downstream classifier is tested on an independent validation set, which was not used for either GAN or classifier training. A metric suitable for imbalanced data evaluation, such as the F1-Score, G-Mean, or Balanced Accuracy of the minority class, is used as the final fitness value for the individual. A higher classification performance metric represents a better fitness, indicating that the corresponding GAN hyperparameter combination is more effective.
Experimental setup
This section aims to validate the effectiveness of the E-IVYA-GAN framework in optimizing GANs for imbalanced data classification tasks through a series of carefully designed experiments.
Datasets, model architectures, and implementation details
Datasets
To comprehensively evaluate the framework’s performance and generality, this study selected three public datasets with different characteristics, covering both tabular and image data types, as well as varying degrees of imbalance:
- Credit-Card Fraud This is a classic, highly imbalanced tabular dataset commonly used in fraud detection research. The minority class (fraudulent transactions) accounts for only 0.172% of the samples, posing a severe challenge for optimization algorithms and generative models.
- Ecoli This is a moderately imbalanced tabular dataset from the UCI repository, involving the classification of protein localization sites. Its minority class samples constitute approximately 5.97%, representing a common imbalance scenario in many bioinformatics applications.
- Imbalanced MNIST To test the framework’s performance on image data, we manually constructed an imbalanced version of the MNIST handwritten digit dataset. We designated the digit ’9’ as the minority class and merged the remaining digits (’0’-’8’) into the majority class, creating a controllable imbalanced image classification task. Data Partitioning Strategy To ensure a rigorous and unbiased evaluation, a fixed data splitting strategy was applied to all three datasets before any experimentation. Each dataset was partitioned once into three independent, non-overlapping sets:
- A Training Set (60% of data), which was used exclusively to train the GAN models.
- A Validation Set (20% of data), which was used to evaluate the downstream classifier and provide the F1-Score as the fitness value to the E-IVYA optimizer.
- A Held-Out Test Set (20% of data), which was completely isolated during the optimization process and used only once to report the final, unbiased performance metrics presented in Table 3. All three sets (Training, Validation, and Test) maintained their original, imbalanced class distribution. This strict partitioning prevents any data leakage and ensures that the final reported results reflect the true generalization performance of the optimized framework.
Model Architectures
To ensure a fair comparison where all performance differences stem from different GAN hyperparameters rather than model structures, the network architectures of all models in this experiment were kept fixed:
- GAN Models For the two tabular datasets, Credit-Card Fraud and Ecoli, we used a standard GAN composed of Multi-Layer Perceptrons (MLPs). For the Imbalanced MNIST image dataset, we employed a classic Deep Convolutional Generative Adversarial Network (DCGAN).
- Downstream Classifiers For tabular data, a simple MLP classifier was used. For the MNIST image data, a Convolutional Neural Network (CNN) with a LeNet-5 architecture was used as the classifier. Hyperparameters to be Optimized
The objective for E-IVYA and other comparison optimizers is to find the optimal combination for the following four key GAN hyperparameters, with their search ranges set as follows:
- Generator Learning Rate ( \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$lr_G$$\end{document} ): [1e-5, 1e-3]
- Discriminator Learning Rate ( \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$lr_D$$\end{document} ): [1e-5, 1e-3]
- Adam Optimizer’s beta1 parameter ( \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\beta _1$$\end{document} ): [0.1, 0.9]
- Latent Vector Dimension ( \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$z\_dim$$\end{document} ):^50^
Implementation details for reproducibility
To ensure full reproducibility, several key implementation details were fixed for the GAN training and evaluation pipeline. First, all experiments were initialized using a global random seed of 42 to guarantee consistent results. Second, during each fitness evaluation performed by the optimizer (i.e., each call to the fitness function), the corresponding GAN model was trained on the original imbalanced training set using a batch size of 128 for a fixed 50 epochs. Finally, after this training, the generator was used to synthesize new minority-class samples. The number of generated samples was set to equal the number of samples in the majority class, thus creating a fully 1:1 balanced dataset for training the downstream classifier.
Baselines and comparison methods
To comprehensively measure the performance of the E-IVYA-GAN framework, we established several baselines and a series of challenging comparison methods. The baselines include training the classifier with No Augmentation (on the original imbalanced data) and using the classic SMOTE oversampling technique. For evaluating optimizer performance, E-IVYA is benchmarked against a diverse group on the GAN hyperparameter optimization task. This group includes Random Search as a non-intelligent benchmark; classic metaheuristics such as PSO, GWO, and the original IVYA; high-performance evolutionary algorithms, specifically the advanced DE variants JADE and LSHADE; and a specialized method for hyperparameter tuning, Bayesian Optimization (BO).
Evaluation metrics
Considering the nature of class imbalance, the traditional accuracy metric can be misleading. Therefore, this experiment employs the following metrics, which are more suitable for imbalanced data classification, to evaluate the final performance of the downstream classifier:
- F1-Score (Minority Class) The harmonic mean of precision and recall for the minority class, serving as a core metric for evaluating imbalanced classification performance.
- G-Mean The geometric mean of the recall of the majority and minority classes, reflecting the model’s performance on both classes simultaneously.
- Balanced Accuracy The arithmetic mean of the recall of each class, also suitable for imbalanced scenarios.
- AUC-PR The area under the Precision-Recall (PR) curve. This metric is more reliable and informative than the traditional AUC-ROC when dealing with highly imbalanced data.
Experimental results and analysis
This section aims to present and analyze the experimental results of the E-IVYA-GAN framework on imbalanced data classification tasks. We will demonstrate the effectiveness and superiority of E-IVYA in optimizing GAN hyperparameters through quantitative performance comparisons, analysis of the optimization process, and an in-depth evaluation of classifier performance.
Quantitative performance comparison and analysis
As presented in Table 3, the approach using E-IVYA to optimize GAN hyperparameters consistently achieves superior results across all three imbalanced datasets on key performance metrics, including F1-Score, G-Mean, and AUC-PR. The performance improvement is particularly pronounced on the highly imbalanced Credit-Card Fraud dataset, where E-IVYA obtains an F1-Score of 0.87, far surpassing the outcomes from both no data augmentation (0.56) and the traditional SMOTE method (0.71).
E-IVYA also demonstrates a distinct advantage over other metaheuristic optimizers. It outperforms strong competitors such as JADE, LSHADE, and even the specialized Bayesian Optimization (BO) in final classification performance. Furthermore, the smaller standard deviation of its results indicates that E-IVYA is not only superior in performance but also highly stable in identifying high-quality hyperparameter combinations. These findings are visually corroborated by the bar chart in Fig. 4, where the bar representing ’GAN (E-IVYA)’ is consistently the highest for the F1-Score metric across all datasets, signifying its comprehensive lead.
These quantitative results provide strong evidence that E-IVYA can more effectively identify a set of hyperparameters that enables a GAN to generate high-quality, diverse minority-class samples compared to the other methods. This, in turn, significantly enhances the classification ability of the downstream classifier on imbalanced data.
In-depth analysis of optimization process and performance
To further analyze the results from the perspectives of process and effect, we have visualized the optimization process and classifier performance. Figure 5 reveals the internal dynamics of the GAN model trained with the optimal hyperparameters found by E-IVYA. The figure shows that the loss curves of the generator and discriminator eventually stabilize during their adversarial game, without collapsing. At the same time, the F1-Score on the validation set steadily increases and converges to a high level. This provides indirect evidence that the hyperparameter combination found by E-IVYA not only leads to excellent final results but also ensures a healthy and efficient GAN training process. Figure 6 displays the Precision-Recall (PR) curves for each method on the most challenging Credit-Card Fraud dataset. The PR curve for E-IVYA is closest to the top-right corner of the plot and achieves the highest AUC-PR value (0.88). This is strong evidence of superior classifier performance in imbalanced classification evaluation. It indicates that the model enhanced by the E-IVYA-optimized GAN can achieve a higher recall rate than other methods while maintaining high precision.
In summary, the E-IVYA-GAN framework demonstrates exceptional performance, whether viewed from the final quantitative performance metrics (Table 3, Fig. 4), the efficiency and stability of the optimization process (Fig. 5), or the in-depth evaluation of classifier performance (Fig. 6). The experimental results prove that E-IVYA, as an advanced optimizer, has great potential for solving complex and computationally expensive AI problems like GAN hyperparameter optimization.Table 3. Comparison of classification performance of each optimisation method on different datasets.DatasetEvaluation metricNo augmentationSMOTEGAN (Random Search)GAN (PSO)GAN (GWO)GAN (Original IVYA)GAN (JADE)GAN (BO)GAN (E-IVYA)Credit-Card FraudF1-Score0.56 (0.05)0.71 (0.04)0.77 (0.05)0.80 (0.03)0.81 (0.03)0.82 (0.02)0.83 (0.02)0.84 (0.02)0.87 (0.01)G-Mean0.68 (0.04)0.81 (0.03)0.84 (0.04)0.87 (0.02)0.87 (0.02)0.88 (0.02)0.89 (0.01)0.89 (0.02)0.91 (0.01)AUC-PR0.59 (0.06)0.73 (0.03)0.79 (0.04)0.82 (0.03)0.82 (0.03)0.83 (0.02)0.84 (0.02)0.85 (0.01)0.88 (0.01)EcoliF1-Score0.69 (0.06)0.81 (0.04)0.84 (0.04)0.86 (0.03)0.86 (0.03)0.87 (0.02)0.88 (0.02)0.88 (0.02)0.91 (0.01)G-Mean0.75 (0.05)0.86 (0.03)0.88 (0.03)0.90 (0.02)0.90 (0.02)0.91 (0.02)0.92 (0.01)0.92 (0.01)0.94 (0.01)AUC-PR0.71 (0.05)0.83 (0.04)0.86 (0.03)0.88 (0.03)0.88 (0.02)0.89 (0.02)0.90 (0.02)0.90 (0.01)0.92 (0.01)Imbalanced MNISTF1-Score0.85 (0.03)0.90 (0.02)0.92 (0.02)0.93 (0.01)0.93 (0.01)0.94 (0.01)0.95 (0.01)0.95 (0.01)0.97 (0.01)G-Mean0.89 (0.02)0.93 (0.01)0.94 (0.01)0.95 (0.01)0.95 (0.01)0.96 (0.01)0.96 (0.01)0.97 (0.01)0.98 (0.01)AUC-PR0.87 (0.03)0.92 (0.02)0.93 (0.02)0.94 (0.01)0.94 (0.01)0.95 (0.01)0.96 (0.01)0.96 (0.01)0.98 (0.01)
Fig. 4. Comparison of the final classification performance (F1-Score) of different methods on various datasets. Fig. 5GAN training dynamics and performance convergence curves under E-IVYA optimization on various datasets. Fig. 6. Precision-Recall (PR) curves of different methods on the Credit-Card Fraud dataset.
Conclusion and future work
This paper proposed a multi-strategy synergistic Enhanced Ivy Algorithm (E-IVYA) to address common challenges in metaheuristic optimization, such as premature convergence and susceptibility to local optima. Through comprehensive experiments on the IEEE CEC 2014 and 2017 benchmark suites and a complex application involving GAN hyperparameter optimization, the proposed E-IVYA has demonstrated robust and competitive performance in this study. The experimental results indicate that E-IVYA achieves statistically significant improvements over the baseline IVYA and several state-of-the-art competitors on the majority of the tested functions, as supported by the Wilcoxon signed-rank tests.Fig. 7. Diversity analysis with the iterations based on three different functions.Fig. 8. Exploration and exploitation analysis with the iterations based on four different functions.Fig. 9. Convergence curves for selected IEEE CEC 2014 benchmark functions (50D). Overall, the plots illustrate that E-IVYA (solid red line) consistently achieves faster convergence rates and superior final solution accuracy on the majority of the test functions, particularly on complex multimodal and composite problems like \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$f_{21}$$\end{document} and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$f_{30}$$\end{document} .Table 4. The mean values for E-IVYA and other comparative algorithms on IEEE CEC 2014 benchmark functions (50D).FunctionE-IVYAIVYAPSOGWODERSAJADELSHADEAOOLCAWUTPF11.8157e+063.5900e+081.0116e+094.6111e+084.3986e+081.3781e+072.4456e+061.9749e+069.2227e+092.1389e+091.7725e+07F23.1021e+031.9051e+089.4005e+106.7820e+063.4682e+082.2289e+043.8400e+031.7651e+072.1197e+111.1398e+111.1925e+09F31.5835e+041.2829e+051.2001e+056.4022e+048.2173e+041.1578e+052.6687e+042.9567e+047.0428e+052.9238e+051.1963e+05F44.7950e+021.0114e+032.0553e+045.1093e+025.7339e+025.2537e+024.8023e+025.2323e+021.7513e+052.8989e+041.3418e+03F55.2346e+025.2471e+025.3529e+025.3400e+025.2718e+025.3475e+025.3562e+025.1769e+025.3216e+025.2255e+025.2562e+02F65.9136e+026.5772e+026.8290e+026.2084e+026.8048e+026.3533e+026.5599e+026.2570e+026.8996e+026.8488e+026.6436e+02F76.8617e+027.1478e+021.6231e+036.9463e+027.2340e+027.1517e+027.0397e+026.9320e+023.2504e+031.8384e+037.2185e+02F88.4357e+021.2144e+031.4057e+038.9321e+021.2585e+038.9856e+029.4759e+028.6575e+021.5869e+031.5126e+031.2785e+03F91.1356e+031.3853e+031.5744e+031.1594e+031.3968e+031.0776e+031.1539e+031.1294e+031.9329e+031.6853e+031.4326e+03F101.4429e+031.2529e+041.4481e+044.5459e+031.2847e+045.5682e+035.3789e+033.2796e+031.8465e+041.4654e+041.2818e+04F119.9016e+031.4889e+041.5317e+041.4616e+041.5732e+041.4618e+041.1311e+041.0560e+041.8546e+041.5034e+041.4439e+04F121.2335e+031.2325e+031.2189e+031.2303e+031.2461e+031.2307e+031.2215e+031.1852e+031.2324e+031.2361e+031.2104e+03F131.2941e+031.3325e+031.3343e+031.3090e+031.3195e+031.3370e+031.3323e+031.3259e+031.3217e+031.3268e+031.3312e+03F141.3781e+031.4420e+031.6384e+031.4363e+031.4131e+031.4394e+031.4339e+031.4118e+032.1190e+031.7481e+031.4323e+03F151.4996e+031.8902e+035.1559e+051.5583e+031.6569e+031.5540e+031.5593e+031.5901e+036.9458e+075.5683e+064.2372e+03F161.6521e+031.6319e+031.6644e+031.6499e+031.6631e+031.6477e+031.6607e+031.6116e+031.6524e+031.6635e+031.6528e+03F171.6881e+051.8596e+071.7554e+081.8633e+061.4395e+072.0837e+062.3025e+051.7455e+051.6562e+091.4705e+081.6247e+07F182.1559e+032.5029e+039.2968e+092.8550e+031.9567e+052.5539e+032.2155e+035.7601e+032.8727e+102.5152e+093.4253e+03F191.8847e+032.0163e+032.6288e+031.9689e+031.9961e+031.9620e+031.9427e+031.9863e+035.6883e+032.3734e+032.0007e+03F202.8083e+033.5517e+047.1197e+041.8860e+045.9525e+044.1030e+044.8693e+033.1493e+032.4599e+062.6841e+055.0601e+04F211.0506e+056.0792e+062.3486e+071.2869e+062.5229e+062.3831e+061.0041e+054.5165e+055.9189e+084.5422e+074.8696e+06F223.2183e+033.9634e+038.4116e+033.2662e+033.9790e+033.3283e+033.3448e+033.9213e+033.9797e+065.2346e+033.9142e+03F232.6508e+032.6835e+032.9231e+032.6559e+032.6853e+032.7090e+032.6628e+032.6071e+035.4674e+033.4443e+032.7157e+03F242.6937e+032.6732e+032.7262e+032.6358e+032.7270e+032.6838e+032.7489e+032.6042e+033.1936e+032.8712e+032.6042e+03F252.7303e+032.7661e+032.7788e+032.7844e+032.7758e+032.7839e+032.7562e+032.6853e+033.4904e+032.9550e+032.7865e+03F262.6687e+032.7719e+032.8488e+032.8361e+032.7618e+032.7523e+032.7831e+032.7735e+033.3618e+032.7885e+032.8362e+03F273.0123e+034.3146e+035.3094e+033.3444e+034.5453e+033.4616e+033.7937e+034.0264e+035.5186e+035.0345e+034.3411e+03F284.1729e+035.9329e+031.4468e+044.2952e+034.4140e+034.3400e+035.1718e+034.9818e+031.6373e+049.5855e+036.3986e+03F294.5670e+031.0560e+061.4055e+096.6496e+036.3275e+044.7932e+035.6190e+036.7083e+078.3242e+082.9463e+081.7667e+06F301.1783e+041.3530e+052.6105e+071.9567e+042.4578e+041.9680e+043.2530e+042.6186e+044.2929e+074.2796e+061.9288e+05Rank 1****180000116004Boldface indicates the algorithm that obtained the optimal experimental result for a particular test function in this test suite.Table 5. The mean values for E-IVYA and other comparative algorithms on IEEE CEC 2014 benchmark functions (100D).FunctionE-IVYAIVYAPSOGWODERSAJADELSHADEAOOLCAWUTPF18.6341e+073.1203e+091.4589e+101.2587e+081.5168e+092.2965e+081.0373e+089.0416e+078.3045e+091.1578e+101.5435e+09F21.4925e+048.2435e+104.3129e+112.6502e+086.4462e+101.9961e+093.1118e+044.9080e+082.9158e+114.5630e+116.1362e+10F38.7548e+043.4868e+059.2736e+052.1867e+053.0306e+053.0182e+051.4913e+051.0117e+057.2343e+057.7946e+053.1257e+05F46.4215e+028.9482e+031.5204e+058.5750e+021.0601e+041.0822e+036.9427e+026.8378e+021.7699e+051.5701e+051.0336e+04F55.1794e+025.2443e+025.2533e+025.2541e+025.2281e+025.2638e+025.2434e+025.2348e+025.2592e+025.2283e+025.2423e+02F66.8443e+027.4042e+027.5959e+026.8834e+027.4219e+026.9416e+026.7901e+026.6021e+027.6432e+027.5796e+027.2995e+02F76.9482e+021.5833e+034.5427e+037.1824e+021.2965e+037.1997e+027.0396e+027.0425e+023.6339e+035.0645e+031.2721e+03F81.0334e+031.9443e+032.1466e+031.1718e+031.9761e+031.1593e+031.2185e+031.1328e+032.2964e+032.6713e+031.9381e+03F91.6373e+032.1585e+032.3421e+031.8385e+032.1472e+031.5034e+031.6433e+031.6033e+033.0617e+033.0626e+032.2348e+03F106.8462e+033.0976e+043.2429e+041.8109e+042.9515e+041.5039e+041.1666e+048.2166e+033.3218e+043.2798e+042.9248e+04F112.9461e+043.2435e+043.3768e+043.2201e+043.0648e+043.1783e+042.9868e+042.9641e+043.4214e+043.2753e+043.0368e+04F121.1917e+031.2141e+031.2263e+031.2104e+031.2201e+031.2230e+031.2111e+031.2183e+031.2290e+031.2102e+031.2120e+03F131.2882e+031.3149e+031.3283e+031.3094e+031.3259e+031.3219e+031.3090e+031.3204e+031.3347e+031.3323e+031.3147e+03F141.3916e+031.6669e+032.3023e+031.4116e+031.5976e+031.4159e+031.4246e+031.4217e+032.2575e+032.7214e+031.5847e+03F151.5312e+031.2384e+061.6573e+071.6360e+034.9080e+052.0006e+031.6685e+031.5746e+037.0267e+072.1465e+084.8519e+05F161.6318e+031.6601e+031.6558e+031.6616e+031.6675e+031.6521e+031.6508e+031.6499e+031.6582e+031.6622e+031.6631e+03F173.8447e+062.8576e+081.1873e+091.7062e+071.7145e+083.9317e+074.2257e+063.7088e+061.4328e+091.1691e+091.6934e+08F182.2854e+033.1518e+033.5510e+106.0142e+044.0913e+043.0336e+032.5036e+032.4344e+032.8188e+102.5938e+104.0734e+04F191.9806e+032.3429e+038.6834e+032.0112e+032.2274e+032.0005e+032.0359e+032.0559e+036.0717e+036.0645e+032.2612e+03F203.5601e+043.0827e+057.0988e+058.4116e+042.5857e+052.0396e+055.1481e+044.1485e+042.4549e+063.2530e+062.5367e+05F212.6502e+061.0560e+083.0617e+087.4249e+066.2644e+077.3751e+063.1970e+062.9554e+065.8624e+084.7891e+086.3197e+07F223.9789e+036.8361e+034.0673e+045.1764e+036.8530e+037.0863e+034.8115e+034.5422e+033.9317e+069.9754e+037.0253e+03F232.7171e+032.9258e+036.0640e+032.8465e+032.9620e+032.6241e+032.7533e+032.6750e+035.3789e+035.9231e+032.9224e+03F242.8366e+033.1257e+034.0488e+032.8023e+032.5973e+032.8596e+032.7523e+032.7483e+033.1965e+033.9317e+033.0903e+03F252.8252e+033.1278e+033.6300e+032.8106e+032.6844e+032.7915e+032.7744e+032.7237e+033.4478e+033.8329e+033.1365e+03F262.7142e+033.0950e+033.1200e+032.8385e+032.8016e+032.8597e+032.8223e+032.8344e+033.3283e+032.7667e+032.8389e+03F273.9213e+036.6496e+038.6946e+034.5369e+036.2847e+035.1233e+034.6300e+034.2981e+035.5970e+037.4975e+036.4673e+03F286.3533e+031.9567e+042.3556e+047.1519e+032.2154e+041.2016e+048.2323e+037.4836e+031.6253e+042.3060e+042.2570e+04F296.7121e+031.6372e+081.8386e+091.3039e+054.9603e+071.8492e+048.7909e+041.2828e+058.6019e+081.4927e+094.9123e+07F303.6942e+047.3787e+068.8471e+071.5034e+053.2505e+061.1449e+059.9079e+048.0805e+044.2562e+076.5516e+073.2505e+06Rank 1240002202000Boldface indicates the algorithm that obtained the optimal experimental result for a particular test function in this test suite.Table 6. The mean values for E-IVYA and other comparative algorithms on IEEE CEC 2017 benchmark functions (50D).FunctionE-IVYAIVYAPSOGWODERSAJADELSHADEAOOLCAWUTPF11.1396e+062.1481e+097.0298e+093.0132e+083.7337e+082.7686e+094.0722e+061.3522e+071.0315e+085.1843e+064.1039e+06F22.7698e+039.7186e+102.4507e+114.5492e+073.1678e+081.4588e+116.6416e+035.5901e+071.6214e+115.7611e+036.2082e+08F39.6417e+021.3283e+051.7610e+051.3481e+051.6841e+051.5583e+051.5888e+041.6419e+041.6318e+059.8784e+031.5791e+04F44.8812e+022.2173e+045.2154e+043.2359e+038.3286e+023.5186e+045.3180e+026.1387e+023.4975e+045.0345e+025.3262e+02F55.1742e+025.2494e+025.2505e+025.2185e+025.2514e+025.2493e+025.2346e+025.1837e+025.2536e+025.2562e+025.2325e+02F66.2798e+026.6908e+026.7580e+026.4719e+026.6119e+026.7865e+026.3094e+026.0822e+026.6787e+026.2423e+026.3683e+02F76.9416e+021.5971e+032.2344e+039.1128e+027.0863e+022.0628e+037.0427e+027.0451e+022.2882e+037.0232e+027.0465e+02F88.4983e+021.3481e+031.4168e+031.0597e+031.2059e+031.4729e+039.1328e+028.6826e+021.4449e+038.7997e+029.2155e+02F91.1278e+031.4923e+031.6253e+031.2078e+031.3725e+031.6508e+031.1542e+031.0927e+031.5861e+031.1271e+031.1667e+03F103.3854e+031.4172e+041.5540e+048.6475e+031.3204e+041.4394e+045.6179e+033.7915e+031.5369e+043.6335e+035.6457e+03F114.8368e+031.4589e+041.5969e+041.2177e+041.3854e+041.5539e+041.1895e+041.0335e+041.5831e+049.8009e+031.1965e+04F121.1925e+031.2069e+031.2136e+031.2033e+031.2069e+031.2078e+031.2062e+031.2093e+031.2111e+031.2092e+031.2087e+03F131.2882e+031.3168e+031.3134e+031.3094e+031.3087e+031.3174e+031.3081e+031.3090e+031.3193e+031.3094e+031.3072e+03F141.3888e+031.6429e+031.8021e+031.4674e+031.4103e+031.7061e+031.4057e+031.4116e+031.7854e+031.4093e+031.4042e+03F151.5008e+034.8159e+055.9189e+063.5350e+041.5796e+033.5701e+061.5459e+031.5721e+035.9620e+061.5303e+031.5398e+03F161.6107e+031.6263e+031.6293e+031.6315e+031.6247e+031.6288e+031.6290e+031.6253e+031.6258e+031.6277e+031.6290e+03F174.1209e+042.2858e+088.4119e+082.6710e+078.4239e+065.0934e+082.9248e+053.7371e+058.5137e+082.0264e+052.9463e+05F182.7681e+037.2144e+092.4939e+102.9972e+084.0202e+051.3768e+102.7876e+033.1932e+032.4727e+102.9649e+032.8091e+03F191.9547e+031.9934e+032.6393e+032.0967e+031.9442e+034.0150e+031.9168e+031.9708e+035.3402e+031.9536e+031.9599e+03F202.3888e+038.6385e+044.3592e+056.5165e+044.2980e+043.2201e+051.4593e+041.2163e+044.4143e+054.2255e+031.4552e+04F218.5447e+041.8322e+071.3664e+086.4764e+066.1952e+069.0116e+078.4189e+042.6366e+051.3789e+088.9740e+048.5443e+04F222.6953e+031.1738e+048.0494e+053.5855e+034.2505e+031.6961e+053.4410e+033.6146e+038.1130e+053.3932e+033.4475e+03F232.6644e+032.9555e+032.5186e+032.8550e+032.6517e+032.5097e+032.6101e+032.6622e+032.5109e+032.6288e+032.6580e+03F242.6968e+032.6738e+032.6166e+032.6841e+032.7154e+032.6144e+032.6941e+032.7132e+032.6166e+032.6946e+032.5761e+03F252.6741e+032.7235e+032.7145e+032.7214e+032.7562e+032.7161e+032.7483e+032.7454e+032.7118e+032.7303e+032.7431e+03F262.7458e+032.8080e+032.8020e+032.7845e+032.8122e+032.8105e+032.7938e+032.7865e+032.8084e+032.7820e+032.7946e+03F273.2843e+035.3411e+035.1091e+034.1565e+034.3972e+034.9818e+033.5901e+033.8443e+035.1118e+033.5960e+033.5976e+03F283.5761e+031.5599e+041.2844e+043.6334e+036.5840e+031.1738e+045.1278e+035.3134e+031.2981e+044.5824e+035.1215e+03F293.0768e+031.3090e+093.1241e+033.1539e+034.0305e+072.4646e+077.3912e+031.3023e+073.1205e+036.7088e+067.3794e+03F303.9984e+031.8331e+074.5583e+064.0722e+031.0768e+055.1979e+062.1158e+042.2343e+044.5529e+061.4162e+042.1121e+04Rank 1220000023003Boldface indicates the algorithm that obtained the optimal experimental result for a particular test function in this test suite.Table 7. The mean values for E-IVYA and other comparative algorithms on IEEE CEC 2017 benchmark functions (100D).FunctionE-IVYAIVYAPSOGWODERSAJADELSHADEAOOLCAWUTPF11.4392e+074.0436e+091.0853e+109.3242e+082.6888e+098.1678e+096.5501e+071.6961e+081.1077e+106.0954e+076.6415e+07F21.5794e+042.1624e+113.1234e+115.5413e+091.2584e+102.8687e+119.8038e+071.9213e+073.1554e+111.9568e+071.0366e+10F31.6778e+042.8643e+053.1787e+053.2982e+054.2934e+053.1493e+058.0776e+049.8887e+043.1818e+059.9241e+048.1994e+04F47.4011e+025.3341e+041.1189e+051.1368e+042.3087e+037.8937e+048.6836e+021.5123e+031.1147e+057.6322e+028.6946e+02F55.1783e+025.2343e+025.2477e+025.2536e+025.2560e+025.2435e+025.2415e+025.2329e+025.2285e+025.2522e+025.2598e+02F66.9248e+027.5583e+027.6041e+027.1352e+027.5928e+027.6250e+027.0263e+026.4589e+027.6166e+026.7410e+027.0427e+02F76.9531e+022.9099e+033.8443e+031.6258e+038.0441e+023.5513e+037.0673e+027.5746e+023.8903e+037.0543e+027.0617e+02F81.0123e+032.0318e+032.1934e+031.5898e+031.8105e+032.2530e+031.3687e+031.2464e+032.1741e+031.3697e+031.3695e+03F91.6888e+032.2741e+032.4042e+031.7018e+031.9798e+032.3831e+031.7291e+031.6083e+032.4013e+031.7299e+031.7311e+03F101.1417e+043.1365e+043.3283e+042.1332e+043.0569e+043.1265e+041.8519e+041.3533e+043.3312e+041.7088e+041.8317e+04F111.0718e+043.0678e+043.2687e+042.5742e+043.1866e+043.1189e+042.7845e+042.5649e+043.2694e+042.5202e+042.7876e+04F121.1920e+031.2069e+031.2063e+031.2075e+031.2071e+031.2043e+031.2065e+031.2060e+031.2057e+031.2091e+031.2053e+03F131.2911e+031.3168e+031.3175e+031.3150e+031.3056e+031.3160e+031.3061e+031.3055e+031.3161e+031.3060e+031.3041e+03F141.3912e+032.0537e+032.3423e+031.6436e+031.4385e+032.2351e+031.4079e+031.4191e+032.3444e+031.4082e+031.4073e+03F151.5401e+034.7679e+063.0270e+072.5936e+054.3582e+041.4842e+071.6429e+032.7758e+033.0288e+071.6130e+031.6593e+03F161.6318e+031.6570e+031.6582e+031.6599e+031.6572e+031.6568e+031.6464e+031.6457e+031.6575e+031.6577e+031.6467e+03F178.1001e+057.4764e+082.0718e+091.0460e+088.4485e+071.2152e+093.7126e+066.6669e+062.0729e+094.0321e+063.7135e+06F182.6314e+032.1834e+104.7208e+102.3551e+096.3073e+063.5113e+103.8211e+034.6190e+054.7196e+104.2371e+033.8239e+03F191.9968e+035.8569e+031.3078e+042.6154e+032.1724e+039.0700e+032.0526e+032.0917e+031.3082e+042.0398e+032.0504e+03F203.5711e+034.2562e+051.3283e+062.9157e+057.1528e+058.3644e+056.7621e+047.1329e+041.3287e+065.2583e+046.7601e+04F211.5592e+061.8010e+086.2974e+085.3743e+073.6705e+073.7732e+081.4988e+063.2592e+066.2996e+081.7408e+061.5191e+06F223.3012e+031.8412e+044.6678e+055.6987e+037.5097e+031.0560e+055.5539e+035.8827e+034.6685e+055.4729e+035.5524e+03F232.6953e+033.1594e+032.5159e+033.2687e+032.7369e+032.5153e+032.6659e+032.7274e+032.5165e+032.6664e+032.4811e+03F242.6634e+032.7663e+032.6171e+032.6158e+032.9378e+032.6166e+032.8465e+032.9080e+032.6158e+032.8471e+032.5619e+03F252.6804e+032.7461e+032.7163e+032.7570e+032.8778e+032.7153e+032.8166e+032.8152e+032.7158e+032.7997e+032.8152e+03F262.7788e+032.8158e+032.8173e+032.8272e+032.8772e+032.8167e+032.8160e+032.8159e+032.8167e+032.8165e+032.8163e+03F274.0199e+038.3512e+038.0898e+035.9149e+035.8239e+037.3829e+034.7865e+035.5830e+038.0886e+034.6860e+034.7877e+03F285.2711e+033.6318e+042.9377e+045.4121e+031.4485e+041.9616e+041.3175e+041.0152e+042.9379e+046.9582e+031.3179e+04F293.0784e+032.0125e+093.1189e+033.1593e+033.4812e+073.1190e+031.1666e+042.1121e+073.1189e+031.7505e+041.1670e+04F306.4021e+031.4285e+081.0456e+076.6908e+038.3653e+051.0760e+071.6896e+051.4891e+051.0457e+072.8763e+041.6904e+05Rank 1250000111002Boldface indicates the algorithm that obtained the optimal experimental result for a particular test function in this test suite.
The primary contributions of this work are fourfold. First, we introduced a dynamic perturbation framework combining elite-guided symmetric exploration and adaptive asymmetric perturbation to effectively maintain population diversity. Second, we designed an intelligent escape mechanism based on elite differential mutation, providing the algorithm with a guided capability to escape from local optima. Third, an adaptive movement strategy integrating the principles of the Sine-Cosine Algorithm (SCA) was developed for an improved dynamic balance between exploration and exploitation. Finally, the application of the E-IVYA-GAN framework demonstrates the potential capability of the proposed algorithm in solving computationally expensive, real-world AI optimization tasks.Fig. 10. Convergence curves for selected IEEE CEC 2014 benchmark functions (100D). In the higher-dimensional setting, the performance advantage of E-IVYA becomes more pronounced, demonstrating its strong scalability and robustness by maintaining rapid convergence and high precision across different function types.Fig. 11. Convergence curves for selected IEEE CEC 2017 benchmark functions (50D). On this more challenging test suite, E-IVYA continues to exhibit its superiority, outperforming advanced DE variants like JADE and LSHADE on most of the presented functions, which highlights the effectiveness of its synergistic enhancement strategies.Fig. 12. Convergence curves for selected IEEE CEC 2017 benchmark functions (100D). The results at 100 dimensions further confirm E-IVYA’s robust performance and scalability, as it consistently finds better solutions than its competitors in a highly complex and high-dimensional search space.Fig. 13. Ablation Study of E-IVYA Components.Fig. 14. Ablation Study of E-IVYA Components.Fig. 15. Sensitivity analysis of the elite archive percentage (p), with F fixed at 0.5.Fig. 16. Sensitivity analysis of the scaling factor (F), with p fixed at 0.1.
Despite its promising performance, E-IVYA has several limitations that warrant discussion. From an algorithmic perspective, the integration of new mechanisms increases its structural complexity and introduces control parameters that currently lack self-adaptivity. More importantly, as detailed in our runtime analysis, these enhancements lead to a higher computational cost per iteration (approximately 1.26x to 1.37x) compared to the original IVYA. This trade-off between improved optimization accuracy and increased runtime cost is a critical consideration, as the higher overhead might make E-IVYA less suitable for applications with extremely tight time constraints. From a problem-modeling perspective, the E-IVYA-GAN framework simplifies the complex task of GAN evaluation into a single-objective optimization problem. This approach, while effective in this study, may overlook other critical quality aspects such as sample fidelity and intra-class diversity. Finally, while validated on several datasets, the generalizability of E-IVYA to even higher-dimensional problems or other generative models remains an open question requiring further investigation.Table 8. Results of the Wilcoxon’s signed-rank test for E-IVYA and other meta-heuristic algorithms on IEEE CEC 2014 benchmark functions.E-IVYA vsIVYAPSOGWODERSAJADELSHADEAOOLCAWUTP50D**p1.15E-061.73E-061.73E-061.73E-061.36E-054.86E-053.15E-031.24E-051.60E-041.73E-06 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\alpha = 0.1$$\end{document} YesYesYesYesYesYesYesYesYesYes \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\alpha = 0.05$$\end{document} YesYesYesYesYesYesYesYesYesYes100D**p1.13E-061.73E-061.73E-061.73E-061.13E-056.16E-058.42E-027.16E-051.24E-041.73E-06 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\alpha = 0.1$$\end{document} YesYesYesYesYesYesYesYesYesYes \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\alpha = 0.05$$\end{document} YesYesYesYesYesYesNoYesYesYesTable 9Results of the Wilcoxon’s signed-rank test for E-IVYA and other meta-heuristic algorithms on IEEE CEC 2017 benchmark functions.E-IVYA vsIVYAPSOGWODERSAJADELSHADEAOOLCAWUTP50D**p1.24E-051.73E-061.99E-054.86E-051.73E-069.32E-044.11E-029.32E-052.38E-041.60E-05 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\alpha = 0.1$$\end{document} YesYesYesYesYesYesYesYesYesYes \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\alpha = 0.05$$\end{document} YesYesYesYesYesYesYesYesYesYes100D**p1.13E-051.73E-061.13E-056.16E-051.73E-061.24E-039.58E-021.24E-047.16E-044.86E-05 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\alpha = 0.1$$\end{document} YesYesYesYesYesYesYesYesYesYes \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\alpha = 0.05$$\end{document} YesYesYesYesYesYesNoYesYesYesTable 10Performance Comparison of IVYA Variants.Test platformMetricIVYA(Original)IVYA + DPF(Imp. 1)IVYA + EDM(Imp. 2)IVYA + SCA(Imp. 3)E-IVYA(Complete)CEC 2014 F1 (50D)Mean Fitness \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$2.44 \times 10^8$$\end{document} \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1.98 \times 10^7$$\end{document} \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1.70 \times 10^7$$\end{document} \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1.84 \times 10^7$$\end{document} \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$3.35 \times 10^6$$\end{document} CEC 2014 F8 (50D)Mean Fitness \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1.19 \times 10^3$$\end{document} \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$9.23 \times 10^2$$\end{document} \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$8.92 \times 10^2$$\end{document} \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$8.64 \times 10^2$$\end{document} \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$8.2678 \times 10^2$$\end{document} CEC 2014 F21 (50D)Mean Fitness \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$6.66 \times 10^6$$\end{document} \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$2.15 \times 10^6$$\end{document} \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1.37 \times 10^6$$\end{document} \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1.27 \times 10^5$$\end{document} \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1.12 \times 10^5$$\end{document} Credit-Card FraudF1-Score0.82 (0.02)0.83 (0.02)0.85 (0.02)0.84 (0.02)0.87 (0.01)Table 11. Empirical Runtime Analysis (Average Wall-Clock Time per Run)Test platformMetricIVYA(Baseline)PSO(Classic)JADE(Advanced)E-IVYA(Ours)CEC 2014 F1 (50D)Mean Time (s)62.5151.3371.2085.43Norm. Cost1.00x0.82x1.14x1.37xCEC 2014 F8 (50D)Mean Time (s)65.8453.1274.9190.17Norm. Cost1.00x0.81x1.14x1.37xCEC 2014 F21 (50D)Mean Time (s)81.3368.7092.42110.61Norm. Cost1.00x0.84x1.14x1.36xGAN OptimizationMean Time (h)8.177.929.0610.31(Credit-Card)Norm. Cost1.00x0.97x1.11x1.26x
Based on the findings and limitations of this study, several promising directions for future research are apparent. To address parameter sensitivity, future work could focus on designing self-adaptive mechanisms, potentially using reinforcement learning, to dynamically adjust control parameters during the run. To tackle the single-objective limitation, extending E-IVYA into a multi-objective optimization framework is a compelling direction. For instance, integrating E-IVYA with a well-established multi-objective framework could allow for the simultaneous optimization of both classification performance and sample diversity metrics. Finally, the optimization capabilities of E-IVYA could be applied to a wider range of complex AI tasks, such as Neural Architecture Search (NAS) or policy optimization in reinforcement learning.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Hamad, Q. S. et al. A review of enhancing sine cosine algorithm: Common approaches for improved metaheuristic algorithms. Archives of Computational Methods in Engineering 1–58 (2024).
- 2Aula, S. A. & Rashid, T. A. Foxtsage vs. adam: Revolution or evolution in optimization? Cognitive Systems Research 101373 (2025).
- 3Molina, D., Moreno-García, F. & Herrera, F. Analysis among winners of different ieee cec competitions on real-parameters optimization: Is there always improvement? In 2017 IEEE congress on evolutionary computation (CEC), 805–812 (IEEE, 2017).
- 4Tanabe, R. & Fukunaga, A. S. Improving the search performance of shade using linear population size reduction. In 2014 IEEE congress on evolutionary computation (CEC), 1658–1665 (IEEE, 2014).
- 5Wang, R.-B. et al. The animated oat optimization algorithm: A nature-inspired metaheuristic for engineering optimization and a case study on wireless sensor networks. Knowledge-Based Systems 113589 (2025).
- 6Kalita, K. et al. Multi-objective liver cancer algorithm: A novel algorithm for solving engineering design problems. Heliyon 10 (2024).10.1016/j.heliyon.2024.e 26665 PMC 1093759338486727 · doi ↗ · pubmed ↗
- 7Braik, M. & Al-Hiary, H. A novel meta-heuristic optimization algorithm inspired by water uptake and transport in plants. Neural Computing and Applications 1–82 (2025).
