Multi-Mechanism Artificial Lemming Algorithm for Global Optimization and Color Multi-Threshold Image Segmentation
Liang Tao, Lingzhi Li, Fan Lu

TL;DR
This paper introduces MALA, an improved optimization algorithm for color image segmentation that outperforms existing methods.
Contribution
The novel Multi-Mechanism Artificial Lemming Algorithm (MALA) enhances exploration-exploitation balance and boundary handling for global optimization.
Findings
MALA outperforms ALA and baseline algorithms on the CEC2017 benchmark suite.
MALA achieves higher PSNR, SSIM, and FSIM metrics in image segmentation tasks.
The algorithm improves stability and avoids premature convergence in optimization.
Abstract
Color multi-threshold image segmentation is a non-convex, gradient-free global optimization problem. The number of decision variables increases with the number of thresholds, leading to a rapid expansion of the search space and increased computational complexity. To address this problem, this paper proposes a Multi-Mechanism Artificial Lemming Algorithm (MALA). When applied to color multi-threshold image segmentation, the original Artificial Lemming Algorithm (ALA) suffers from an imbalance between exploration and exploitation, excessive reliance on the current best solution, and rigid boundary handling, which may lead to premature convergence and suboptimal threshold selection. MALA integrates three lightweight yet structurally enhancement mechanisms to enhance the stability of the exploration–exploitation process, population-level guidance, and boundary-handling behavior. To verify…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11
Figure 12
Figure 13
Figure 14
Figure 15
Figure 16
Figure 17Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMetaheuristic Optimization Algorithms Research · Medical Image Segmentation Techniques · Advanced Neural Network Applications
1. Introduction
With the continuous development of computer vision, multi-threshold image segmentation has been widely applied in many fields, such as medical image analysis [1,2], remote sensing image interpretation [3], object detection [4], and scene understanding [5]. In practice, images often exhibit complex textures, varying lighting conditions, and inhomogeneous distribution of target regions [6]. As a result, in color multi-threshold image segmentation, the number of decision variables increases with the number of thresholds, which causes the search space to expand rapidly and makes the problem non-convex and gradient-free [7,8,9]. When attempting to exhaustively search all combinations of thresholds or to precisely model this problem, limited computational resources make it difficult to obtain an effective solution within a reasonable time [10,11,12,13]. Consequently, color multi-threshold image segmentation is a challenging global optimization problem that requires efficient and stable meta-heuristic strategies [14].
To address the above optimization challenges in color multi-threshold image segmentation, various bio-inspired optimization algorithms, such as Particle Swarm Optimization (PSO) [15], Salp Swarm Algorithm (SSA) [16], Artificial Bee Colony (ABC) [17], Firefly Algorithm (FA) [18], and Moth–Flame Optimization (MFO) [19], have been widely applied in recent years [20]. However, with the adoption of increasingly complex structures and control strategies, the dependence of these algorithms on parameters has increased significantly, which further aggravates stability and constraint-handling issues in multi-threshold image segmentation [21,22]. In this context, how to improve the stability and robustness of the search process under complex constraints while maintaining a concise structure and controllable parameters remains a critical issue in color multi-threshold image segmentation.
To develop an optimization framework that maintains structural simplicity while providing improved stability under constraints, Xiao et al. [23] proposed the Artificial Lemming Algorithm (ALA). By linking individual movement to energy considerations, ALA simulates lemming behavior during long-distance migration, burrowing, foraging, and evading predators and translates these biologically inspired survival strategies into an effective search mechanism. Compared with many meta-heuristic algorithms that rely on numerous control parameters, ALA features a simple structure, few parameters, and is easy to implement, and has demonstrated competitive performance on a variety of benchmark optimization problems, especially under limited evaluation budgets. However, the ALA still has several practical problems when used for color multi-threshold image segmentation, which is a gradient-free and bound-constrained optimization task. Specifically, the energy-gating mechanism may bias individuals toward a single behavioral branch during the middle and late iterations, leading to unstable transitions between exploration and exploitation. Moreover, population guidance that relies solely on the current best solution can be sensitive to noise and local optima. In addition, the direct boundary projection strategy may disrupt well-adapted solution components, thereby inducing boundary clustering or oscillatory behavior. As a result, these limitations may lead to premature convergence and trapping in local optima.
To alleviate these issues and improve the stability and reliability of threshold optimization, while retaining its biologically inspired foundation, this paper proposes a Multi-Mechanism Artificial Lemming Algorithm (MALA). MALA maintains low computational complexity while introducing multiple lightweight and structured enhancement mechanisms. It reduces sensitivity to stochastic fluctuations and boundary-handling effects, thereby improving the stability and adaptability of the search process. The main contributions are summarized as follows:
- An energy-based behavior smoothing mechanism is proposed to enhance transition stability between exploration and exploitation, thereby reducing premature convergence and improving the reliability of threshold selection in multi-threshold image segmentation.
- A population guidance refinement strategy incorporates significant elite solution information, which provides stable guidance at the population level and improves convergence robustness when optimizing multiple thresholds simultaneously.
- A boundary-handling improvement strategy uses a localized repair strategy to address constraint violations, thereby avoiding boundary clustering phenomena and ensuring the feasibility of threshold configurations in color multi-threshold image segmentation.
- MALA is evaluated using the CEC2017 benchmark test suite to verify its general optimization capability, and on color multi-threshold image segmentation problems to demonstrate its practical effectiveness.
The subsequent structure of this paper is as follows: Section 2 first reviews the original ALA and then details the proposed MALA, Section 3 presents the experimental setup and results analysis on the CEC2017 test suite, Section 4 applies the MALA to multi-threshold image segmentation and reports the experimental results, and Section 5 discusses the research conclusions and potential future research directions.
2. Multi-Mechanism Artificial Lemming Algorithm (MALA)
2.1. The Original Artificial Lemming Algorithm (ALA)
The original ALA balances the search range and precision by mimicking four collective behaviors of lemmings: long-distance migration, digging holes, foraging, and evading predators. Only the baseline operators and the energy-based switching rule that are relevant to the subsequent modifications are introduced in this subsection.
2.1.1. Initialization
N is defined as the population size, i is the index of an individual in the population, and is the dimension of the optimization problem. The search bounds are vectors , where . The initial population is created by random initialization:
2.1.2. Energy Factors
To balance exploration (migration/digging) and exploitation (foraging/evading), ALA introduces an energy factor that decreases with the number of iterations. Here, T is the maximum number of iterations, and denotes the current iteration index:
When , the algorithm performs exploration; otherwise, it performs exploitation.
2.1.3. Four Behavior Update Operators
At each iteration, ALA updates individual positions by randomly selecting one of four behavior operators according to the current search phase. In the exploration phase, long-distance migration is selected when ; otherwise, digging holes is performed; in the exploitation phase, foraging is selected when ; otherwise, evading predators is executed.
Since the mathematical formulations of these four behavior update operators are not modified in MALA, their detailed update equations and parameter definitions can be found in the original ALA paper.
2.1.4. Behavioral Gating
If the updated position violates the search bounds, a hard projection is applied:
Here, denotes the component-wise projection operator that enforces bound constraints. Then, greedy selection is performed between and the repaired candidate .
2.2. Proposed Enhancement Modules in MALA
In bound-constrained optimization, the original ALA may become sensitive to three factors. Specifically, it may be affected by stochastic fluctuations around the energy gate, unstable guidance when relying solely on the current best individual, and feasibility repair that may damage well-adapted components. Therefore, MALA retains ALA’s four behavioral operators and introduces three lightweight modules to improve gate stability, guidance stability, and repair stability, respectively.
2.2.1. Logistic Chaos Perturbation Function (LCEF)
In the original ALA, the energy gate is determined by Equation (2), with a decreasing annealing factor. In the mid- and late stages, when the annealing factor becomes small, a single random sample may overly bias some individuals toward one behavioral branch for many iterations (with or persisting in the long term). This exacerbates the imbalance between exploration and exploitation, reinforcing the influence of a single random sample on the search trajectory.
To mitigate this problem, MALA introduces a one-dimensional logistic chaotic sequence to perturb the energy factor. Specifically, it first randomly initializes a seed outside the interval of extreme values to generate the chaotic sequence , where is employed as the perturbation term at iteration t. The logistic map is defined as
where the parameter is fixed to the widely adopted standard value of 4 in the logistic map and not treated as a control parameter in the optimization process [24,25]. Under this setting, the map exhibits chaotic behavior with good ergodic coverage over and is employed as a bounded additive perturbation to modulate the energy gate near the switching threshold.
Based on the original energy formulation of ALA in Equation (2), a bounded chaotic perturbation term is incorporated to introduce controlled temporal variability into the energy gate. In generation t, the energy of individual i is redefined as follows:
Here, controls the perturbation amplitude (default ). The term enables symmetric positive–negative perturbation, ensuring unbiased disturbance to the energy factor. Moreover, the generated perturbation is strictly bounded within , without introducing significant disruptions to the original energy schedule. Given that demonstrates continuous variation within a limited amplitude, the significant temporal fluctuations in indicate persistent dynamic behavior, and moreover, the relative magnitude of these critical effects shows increasingly noticeable influence during the mid- and late stages when demonstrates a systematic decrease.
Accordingly, some individuals temporarily enter exploration branches. This transition results in intermittent long-distance migration behavior and digging patterns. In this way, LCEF maintains an overall search tendency while avoiding premature fixation on exploitation behavior. Even in the convergence phase, the algorithm retains the ability to escape local optima.
2.2.2. Elite Anchor Smoothing (EAS)
To improve the use of population-level statistics, MALA introduces elite anchor smoothing at each iteration.
Consider the set of current fitness values and denote its median as . Here, denotes the objective function to be minimized. The elite set is defined as follows:
The average of the elite set is defined as follows:
The median-based elite set improves robustness, even in the presence of suboptimal values or outliers. Therefore, EAS does not perform global recombination for all individuals, but instead applies a single smoothing update only to the current global best individual.
Let be the global best individual, defined as the solution with the minimum objective value at iteration t. The anchor point is constructed as follows:
Then, only the position of the current best individual is replaced by , while the other remains unchanged.
The median-based elite set is used to reduce sensitivity to extreme fitness values, which improves robustness compared to mean-based selection. By updating only the global best individual, EAS avoids disrupting population diversity while providing a smoothed guidance direction. The averaging factor provides a conservative correction, preventing overly aggressive shifts toward the elite center.
2.2.3. Selective Small-Scale Fixing (S3-F)
MALA adopts Selective Small-Scale Fixing (S3-F) as a refinement of ALA’s direct component-wise projection, aiming to alleviate constraint-violation clustering and late-stage feasibility oscillations.
The candidate position of individual i is denoted by . For each dimension j, the violation indicator is defined as follows:
The set of violated dimensions is defined as follows:
If , the anchor point defined in Equation (8) is used as the reference to repair the violated dimensions:
Here, controls the repair scale (default ). The annealing factor is inherited from the original ALA energy factors and is reused to ensure that the repair scale follows the same temporal decay pattern as the exploration or exploitation transition. By centering the repair around the adaptive anchor point , the correction is biased toward a representative and feasible region of the population, instead of forcing violated components directly onto the boundary. Subsequently, component-wise projection in Equation (3) is applied to ensure feasibility.
In this way, S3-F selectively repairs only the dimensions that violate the constraints relative to the adaptive anchor point, rather than enforcing strict feasibility for all dimensions in each iteration. This strategy suppresses excessive oscillations at the boundaries and ensures the continuity of the search process. Furthermore, decoupling boundary repair from full-dimensional updates enhances convergence stability. The effectiveness of each mechanism will be validated in subsequent experiments.
2.3. Integrated Algorithm Flow
The workflow of MALA is illustrated in Figure 1, and its pseudocode is described in Algorithm 1. At each iteration, the LCEF module first updates the logistic variable and individual energy coefficients. It then determines the behavioral branch of each individual based on the energy threshold and generates candidate solutions according to four behavioral actions. Subsequently, the EAS and S3-F modules are executed sequentially to perform elite anchor smoothing and selective small-scale boundary correction. Finally, a greedy selection strategy is used to update the individual solutions and the global best solution. Algorithm 1: Multi-Mechanism Artificial Lemming Algorithm (MALA).
3. Experiments in the CEC2017 Benchmark Test Suite
The CEC2017 test suite is a widely used benchmark for single-objective continuous optimization and has been adopted in recent meta-heuristic studies [26,27,28]. It comprises 30 test functions. Following common practice, F2 was excluded and results were reported on the remaining 29 functions under 30-dimensional settings.
For the systematic validation of MALA’s effectiveness on general numerical optimization tasks, two experiments were conducted on CEC2017:
- Ablation Experiments: Under the same experimental settings, the main mechanisms of MALA are activated one by one to analyze the individual contributions and synergistic effects of each module.
- Comparative Experiments with Other Algorithms: Under a unified computational budget, MALA is compared with multiple classical and representative algorithms to evaluate its overall advantages across various function types.
3.1. Experimental Setup and Evaluation Metrics
All experiments followed the CEC2017 standardized protocol with and population size . Each function was run independently 30 times with iterations per run. The same set of seeds was used for all algorithms on each function to ensure fair stochastic comparisons.
All algorithms initialized their populations uniformly within the same search space .
For each function, the final fitness values from 30 independent runs were collected and summarized by the mean (Mean) and standard deviation (Std), with the best mean results highlighted in bold. Algorithms were ranked on each function according to Mean to obtain per-function ranks, and the average rank (AvgRank) over all 29 functions was reported.
3.2. Ablation Experiment
To analyze the independent contributions of each improved mechanism, an ablation study was performed with the original ALA as the baseline. This includes the following five variants:
- ALA: The original Artificial Lemming Algorithm;
- MALA_LCEF: Adds only LCEF to ALA, with other improvements disabled;
- MALA_EAS: Adds only EAS to ALA, with other improvements disabled;
- MALA_S3F: Adds only S3-F to ALA, with other improvements disabled;
- MALA: Multi-Mechanism Artificial Lemming Algorithm.
Figure 2a shows a Friedman ranking radar chart for the five variants on the 30-dimensional CEC2017 test suite. The MALA curve is generally closest to the center and ranks first on 19 out of 29 functions.
Figure 2b shows the average rank over the 29 functions. MALA achieves an AvgRank of 1.41, outperforming all single-module variants and the baseline ALA. Among the single-module variants, MALA_S3F (2.03) and MALA_EAS (2.72) outperform ALA in AvgRank. In contrast, MALA_LCEF (4.34) is close to ALA (4.48). However, when integrated with the other two mechanisms, it contributes to better overall performance in MALA.
Overall, MALA demonstrates consistent performance advantages over both its ablation variants and the original ALA across the evaluation metrics. Therefore, MALA is adopted in subsequent experiments and application studies.
3.3. Overall Comparison with Classical and Representative Algorithms
In this subsection, MALA is systematically compared with seven reference algorithms on the CEC2017 test suite, focusing on overall search capability, convergence efficiency, and result stability on complex functions. The comparison algorithms are selected to ensure relevance and fair coverage of different swarm intelligence paradigms. The closely related variants ALA and EALA [29] are included to verify the effectiveness of the proposed enhancement strategies. In addition, a wide range of swarm intelligence algorithms is considered, ranging from traditional swarm intelligence algorithms such as the Sine–Cosine Algorithm (SCA) [30] and Grey Wolf Optimizer (GWO) [31] to more recent and widely used approaches such as the Whale Optimization Algorithm (WOA) [32], Harris Hawks Optimization (HHO) [33], and Manta Ray Foraging Optimization (MRFO) [34].
MALA and ALA/EALA use the parameters recommended in the original studies. In contrast, SCA, GWO, WOA, HHO, and MRFO use typical parameter settings reported in their original publications. Where parameter ranges are suggested, the recommended default values are used.
3.3.1. Comprehensive Results and Ranking Analysis
As shown in Table 1, MALA yields lower mean objective values for most of the tested functions, with its advantage particularly marked for mixed and composite functions, and achieves better expected solution quality under an identical evaluation budget. For instance, on the mixed function F22, the mean objective value obtained by MALA is 2.91 , which is substantially lower than those of ALA (6.27 ) and EALA (5.61 ), indicating superior solution quality under identical evaluation conditions.
Moreover, on composite functions such as F30, MALA exhibits a smaller standard deviation than ALA and EALA. The reduced performance dispersion across independent runs indicates enhanced robustness under constraints.
Table A1 presents the p-values obtained from the Wilcoxon rank-sum test between MALA and each comparison algorithm across the 29 test functions. For most functions, the difference between MALA and the comparison algorithms is statistically significant ( ).
Figure 3 shows the average ranking of the eight algorithms in the 30-dimensional CEC2017 test suite. MALA (1.55) achieves the lowest average rank overall. Thus, the proposed modifications provide a consistent advantage rather than isolated improvements on a few functions.
Figure 4 shows that a smaller radius corresponds to a better rank for the corresponding function. This figure illustrates the ranking distribution across the 29 test functions. MALA achieves first place in 19 of them, and its curve is highly concentrated, indicating consistently stable performance across all functions. Furthermore, MALA remains competitive even on the few functions where it does not attain the best mean fitness.
3.3.2. Convergence Behavior and Stability of MALA on Selected Functions
To analyze the convergence characteristics of each algorithm on different types of problems, this section selects six functions from the CEC2017 test suite. These functions cover basic multimodal functions ( ), mixed functions ( ), and composite functions ( ).
Figure 5 shows the average convergence behavior of each algorithm on the selected CEC2017 functions, where the best-so-far objective value is recorded at every iteration in each run and the curves are averaged over 30 independent runs under the same population size and maximum iterations. For and , MALA shows a convergence trend similar to several comparison algorithms in the early stage, while its objective value continues to decrease in later iterations. In contrast, the convergence curves of some algorithms show limited improvement in later iterations. Distinct differences in convergence behavior are observed among algorithms for , , , and , where MALA demonstrates a relatively stable convergence process.
Moreover, the box plot in Figure 6 statistically characterizes the stability differences between algorithms across 30 independent runs. MALA presents a compact distribution across the six functions, with some competing algorithms exhibit longer boxes, indicating more outliers and a higher degree of uncertainty. This difference is particularly pronounced for hybrid and composite functions.
The average convergence curves and box plots demonstrate that MALA exhibits lower statistical dispersion and higher overall robustness than competing algorithms across all categories of the CEC2017 test suite.
3.4. Additional Benchmark Verification on CEC2014
To further examine the generalization performance of MALA, additional experiments were conducted on the CEC2014 benchmark suite [35]. The computational budget and experimental settings are consistent with those used for the CEC2017 test suite.
Table 2 presents the overall results for the CEC2014 test suite, including and average rank (AvgRank). The results show that MALA maintains competitive performance and achieves the lowest AvgRank (2.23), supporting the effectiveness of the proposed mechanisms on a different benchmark suite. The Wilcoxon rank-sum test results are provided in Table A2.
The above results indicate that MALA achieves overall performance better than ALA, EALA, and other representative swarm intelligence algorithms on the 30-dimensional CEC2017 and CEC2014 test suites. These findings further support the applicability to multi-threshold image segmentation.
4. Application in Color Multi-Threshold Image Segmentation
In this section, MALA is applied to color multi-threshold image segmentation. The stability and overall performance of the proposed algorithm are further examined in image segmentation tasks with increasing decision dimensionality.
Six color images were selected as the test set from the USC-SIPI image database (https://sipi.usc.edu/database/database.php?volume=misc, accessed on 1 January 2026). The selected images include Airplane, Baboon, House, Lake, Car, andBeans. The test images and their RGB histograms are shown in Figure 7.
4.1. Experimental Setup and Evaluation Metrics
Commonly used objective functions in multi-threshold segmentation include Otsu’s criterion [36] and Kapur’s entropy [37]. Otsu’s criterion is adopted due to its simplicity and ease of implementation [38,39], aiming to assess the stability and convergence behavior of optimizers under increasing decision dimensionality. The objective is defined as
Here, the threshold vector for each channel satisfies the condition . When is given, is the probability weight of the k-th segmented region, is the average intensity value of that region, and represents the average intensity value of the entire channel. These values are calculated using the normalized histogram, where denotes the probability of gray level i and .
For color images, a channel-wise segmentation strategy is adopted. For each of the three RGB channels, m thresholds will be considered. This results in a decision dimension of . Then, the objective functions of the three channels are combined to form an overall objective function. That is, . Additionally, to evaluate the results more comprehensively, the following three indicators are calculated:
- The Peak Signal-to-Noise Ratio (PSNR) is an indicator that measures the pixel-level distortion between the segmentation result and the original image [40]. It is defined as follows:
Here, represents the Mean Squared Error. The higher the PSNR, the smaller the pixel distortion.
- The Structural Similarity Index (SSIM) is an indicator used to measure the retention of structural information [41], such as contours and textures, and is defined as follows:
Here, and represent the mean values, and represent the standard deviations, represents the covariance, and and are stabilization constants. The higher the SSIM value, the greater the structural similarity.
- The Feature Similarity Index (FSIM) describes the feature-level similarities between the segmented image and the original image and measures the perceptual consistency at the visual feature level [42]. It is based on two core visual features: phase consistency (PC) and gradient magnitude (GM). Let x and y denote the original image and the segmented image, respectively. denotes the local feature similarity at pixel i, which is jointly determined by the phase consistency and gradient magnitude similarities. The FSIM index evaluates feature similarity at the pixel level, and its mathematical expression is
During implementation, the key metrics PSNR and SSIM are directly calculated on color images, while FSIM is calculated by converting RGB color images to grayscale images to ensure consistency and reproducibility of the results.
In the experiment, the number of thresholds m was set to . The corresponding decision dimension thus took values in . For each image and threshold number m, each algorithm was run independently 20 times. The population size and the maximum number of iterations were and , respectively, and the total number of fitness evaluations was set to 3000. Eight swarm intelligence algorithms, consistent with those used in the CEC2017 experiments, were compared under the same objective function, threshold constraint repair strategy, and stopping criteria.
4.2. Experimental Results and Discussion
4.2.1. OBJ: Statistical Results and Convergence Curves
Figure 8 shows that the average OBJ ranking varies with the number of thresholds m. Moreover, MALA attains the optimal value across all images and all threshold settings, demonstrating its considerable advantage in segmentation quality.
Table 3 presents the average objective function values ( ) for different threshold numbers m. Across different images and dimensions, MALA achieves the best objective values. For instance, in the Airplane image with , MALA attains an objective value of , which is higher than those achieved by ALA ( ) and EALA ( ). Similarly, for the House image under , MALA reaches , while the corresponding results of ALA and EALA are and , respectively.
Furthermore, Figure 9 shows the average convergence curves of the best-so-far OBJ for six test images under . Across different images and threshold numbers, MALA quickly achieves high objective values in the early stages of iteration, showing a continuous upward trend without significant fluctuations. As m increases, the search space and problem complexity grow, leading to more pronounced performance differences among algorithms, which reflects the robustness of MALA in multi-threshold image segmentation tasks.
4.2.2. PSNR, SSIM, FSIM: Ranking Trends and Statistical Results
Figure 10 shows the changes in the average PSNR ranking as the threshold number m increases. MALA consistently maintains the top ranking across all values of m, indicating relatively low pixel distortion in the segmented results.
This trend is consistent with the statistical results reported in Table A3. For instance, on the Airplane image with , MALA achieves a PSNR value of , which is higher than that of the second-ranked GWO ( ), with a smaller standard deviation. GWO ranks second overall, while MRFO consistently holds the third position. In contrast, SCA and HHO occupy lower rankings for most m values, suggesting larger pixel-level distortion.
Figure 11 compares the average SSIM rankings. Moreover, MALA consistently maintains the top ranking with minimal fluctuation as m increases, suggesting stable ability to preserve image structural information. However, GWO ranks second overall, while SCA occupies the last position, indicating weaker preservation of structural and textural information. These patterns appear consistent, and the observation is further supported by the quantitative results summarized in Table A4. On the Car image with , MALA achieves an SSIM value of , which is higher than those obtained by ALA ( ) and EALA ( ).
As shown in Figure 12, the overall trend of FSIM rankings is consistent with the observed trends of PSNR and SSIM. MALA achieves the top ranking for each value of m. The results confirm that MALA effectively preserves perceptually relevant features, such as local contrast and salient structural cues, in multi-threshold segmentation.Thus, the detailed FSIM statistics in Table A5 support this trend.
4.3. Additional Application Verification on Remote Sensing Images
To further validate the applicability and extensibility of MALA in practical scenarios, two remote sensing color images, Baseball Diamondand Beach, are adopted from the UCM Dataset for additional verification in remote sensing image segmentation tasks. Remote sensing images contain multiple land-cover types, such as vegetation, roads, and water, with distinct color distributions, making these data particularly suitable for color multi-threshold segmentation [43]. Furthermore, the selected remote sensing images and their corresponding RGB histograms exhibit characteristics consistent with those illustrated in Figure 13. The experimental settings and evaluation metrics are consistent with those used in previous sections.
Figure 14 shows the segmentation results under different threshold numbers. As the number of thresholds increases, the main land-cover regions are further subdivided, the color layers become richer, and the region boundaries remain clear.
For these remote sensing images, the OBJ statistics shown in Table 4 indicate that MALA is still able to obtain competitive objective function values under different threshold settings, with relatively small standard deviations, indicating stable overall performance. In addition, the detailed results of PSNR, SSIM, and FSIM are listed in Table A6, Table A7 and Table A8. MALA achieves the lowest average ranking across these metrics. This suggests that the improvement in objective value is achieved without sacrificing structural preservation or visual similarity.
In summary, MALA also exhibits good adaptability and stable performance in remote sensing color multi-threshold segmentation tasks.
5. Discussion and Conclusions
In this paper, MALA is designed to enhance the stability and adaptability of ALA, particularly for color multi-threshold image segmentation problems where the search space expands as the number of thresholds increases. The effectiveness of MALA has been validated through a series of numerical experiments. From the algorithm design perspective, MALA improves the convergence behavior and constraint-handling capability of ALA in complex search spaces by introducing complementary stabilization mechanisms, without altering the original behavioral structure of ALA.
Based on the results of the numerical experiments, MALA demonstrates relatively smoother convergence behavior on various functions in the CEC2017 test suite compared to the original ALA and several representative algorithms. Moreover, the average fitness values and average rankings are improved in most cases, with reduced performance variability across different function categories. In the application of color multi-threshold image segmentation, MALA is able to obtain higher Otsu objective values under a limited computational budget, even as the number of thresholds and the dimensionality of the search space increase. MALA performs well in PSNR, SSIM, and FSIM, and the results remain stable across different settings. These results show that combining the three mechanisms improves the robustness of MALA in constrained optimization problems.
This study has some limitations that warrant additional investigation. Moreover, the MALA approach still demonstrates reliance on a limited number of manually defined control parameters, and furthermore, more systematic analysis is necessary to evaluate their generalization across different segmentation scenarios. Given that the experiments on multi-threshold image segmentation are primarily based on traditional natural image datasets and the Otsu objective function, the experiments have not been extended to more complex application scenarios. Nevertheless, such scenarios involve stronger noise interference and more complex semantic structures.
To address these shortcomings, future research may explore several directions. Firstly, it would be meaningful to evaluate the applicability of MALA to other image analysis tasks such as medical image segmentation and multi-objective image optimization. Another direction is to combine MALA with other optimization strategies, which may improve its performance.
Overall, MALA extends the original ALA framework and shows suitability for color multi-threshold image segmentation.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1La Londe R. Xu Z. Irmakci I. Jain S. Bagci U. Capsules for biomedical image segmentation Med. Image Anal.20206810188910.1016/j.media.2020.10188933246227 PMC 7944580 · doi ↗ · pubmed ↗
- 2Bai Y. Zhao D. Yu H. Heidari A.A. Cai Z. Liu L. Chen H. Enhanced sine cosine algorithm for multi-threshold image segmentation: A breast cancer pathology image study Clust. Comput.20252896210.1007/s 10586-025-05643-z · doi ↗
- 3Ding L. Zhu K. Peng D. Tang H. Yang K. Bruzzone L. Adapting Segment Anything Model for Change Detection in VHR Remote Sensing Images IEEE Trans. Geosci. Remote Sens.202462561171110.1109/TGRS.2024.3368168 · doi ↗
- 4Hoeser T. Kuenzer C. Object Detection and Image Segmentation with Deep Learning on Earth Observation Data: A Review—Part I: Evolution and Recent Trends Remote Sens.202012166710.3390/rs 12101667 · doi ↗
- 5Li W. Gu J. Dong Y. Dong Y. Han J. Indoor scene understanding via RGB-D image segmentation employing depth-based CNN and CR Fs Multimed. Tools Appl.202079354753548910.1007/s 11042-019-07882-w · doi ↗
- 6Xu Y. Quan R. Xu W. Huang Y. Chen X. Liu F. Advances in Medical Image Segmentation: A Comprehensive Review of Traditional, Deep Learning and Hybrid Approaches Bioengineering 202411103410.3390/bioengineering 1110103439451409 PMC 11505408 · doi ↗ · pubmed ↗
- 7Houssein E.H. El-din Helmy B. Oliva D. Elngar A.A. Shaban H. Multi-level Thresholding Image Segmentation Based on Nature-Inspired Optimization Algorithms: A Comprehensive Review Metaheuristics in Machine Learning: Theory and Applications; Studies in Computational Intelligence Springer Cham, Switzerland 2021 Volume 96723926510.1007/978-3-030-70542-8_11 · doi ↗
- 8Jardim S. António J. Mora C. Image thresholding approaches for medical image segmentation—Short literature review Procedia Comput. Sci.20232191485149210.1016/j.procs.2023.01.439 · doi ↗
