Variable Bayesian-Based Maximum Correntropy Criterion Cubature Kalman Filter with Application to Target Tracking
Yu Ma, Guanghua Zhang, Songtao Ye, Dou An

TL;DR
This paper introduces a new filtering method for tracking targets in complex environments with non-Gaussian noise and sensor outliers.
Contribution
The novel VBMCC-CKF framework adaptively estimates state and kernel size using variational Bayesian inference without manual tuning.
Findings
VBMCC-CKF reduces trajectory average root mean square error by at least 14.33% in single-target tracking.
The method achieves 40% lower OSPA distance in multi-target tracking under covariance mutations.
It maintains real-time efficiency and superior hit rates in cluttered environments.
Abstract
Target tracking in typical radar applications faces critical challenges in complex environments, including nonlinear dynamics, non-Gaussian noise, and sensor outliers. Current robustness-enhanced approaches remain constrained by empirical kernel tuning and computational trade-offs, failing to achieve balanced noise suppression and real-time efficiency. To address these limitations, this paper proposes the variational Bayesian-based maximum correntropy criterion cubature Kalman filter (VBMCC-CKF), which integrates variational Bayesian inference with CKF to establish a fully adaptive robust filtering framework for nonlinear systems. The core innovation lies in constructing a joint estimation framework of state and kernel size, where the kernel size is modeled as an inverse-gamma distributed random variable. Leveraging the conjugate properties of Gaussian-inverse gamma distributions, the…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7- —National Natural Science Foundation of China
- —Natural Science Basic Research Program of Shaanxi
- —Fundamental Research Funds for the Central Universities
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTarget Tracking and Data Fusion in Sensor Networks · Inertial Sensor and Navigation · Fault Detection and Control Systems
1. Introduction
High-precision state estimation serves as the fundamental cornerstone for ensuring system reliability of dynamic perception and autonomous systems. As intelligent devices are increasingly deployed in complex environments, sensor data are subject to multifaceted effects, including nonlinear propagation characteristics, multi-source heterogeneous noise, and dynamic target interactions [1,2]. Particularly in target tracking scenarios, radar may produce outliers or noise spikes due to environmental interference or hardware malfunctions [3], while adversarial electronic warfare tactics in military applications can introduce extreme noise through deliberate jamming signals [4]. Such scenarios often exhibit heavy-tailed noise distributions, deviating significantly from Gaussian assumptions [5]. Consequently, nonlinear filtering algorithms capable of mitigating these adverse factors constitute the core technology for target tracking, with their performance directly influencing the accuracy and reliability of tracking outcomes.
In recent years, nonlinear filtering techniques have evolved significantly through derivatives of the Kalman filter framework, particularly in addressing complex dynamic systems and high-dimensional nonlinear problems. The extended Kalman filter (EKF) [6,7] linearizes nonlinear models via first-order Taylor expansions within the Kalman framework. However, its reliance on local linearization introduces truncation errors that accumulate in strongly nonlinear scenarios, often leading to filter divergence. Apart from linearization errors, the vulnerability of EKF to environmental disturbances further limits its robustness. Recent innovations address this by integrating reliable measurement selection mechanisms with EKF frameworks [8]. In addition, to address these limitations, probabilistic approximation-based methods such as the unscented Kalman filter (UKF) [9,10,11] and cubature Kalman filter (CKF) [12,13] have emerged. The UKF transmits statistical moment information via unscented transform sampling of Sigma points, thereby enhancing adaptability for strongly nonlinear systems. Nevertheless, high-dimensional sampling can lead to issues of non-positive definite covariance matrices, resulting in numerical instability and degraded precision. In contrast, CKF employs Gaussian-weighted integration rules and approximates nonlinear distributions using the spherical–radial cubature criterion. By ensuring computational efficiency and designing symmetric sampling points, it enhances filtering robustness. In scenarios with higher state dimensions or greater nonlinear intensity, its estimation accuracy and robustness surpass those of the UKF, making it an essential theoretical tool for state estimation in complex dynamic systems. However, these nonlinear variants of the Kalman filter are predicated on the assumption of Gaussian noise. In heavy-tailed noise environments, where higher-order statistical moments dominate, such methods often fail to mitigate error accumulation, ultimately leading to estimation divergence.
State estimation under heavy-tailed noise conditions remains a critical challenge in dynamic systems, with existing approaches primarily focusing on noise statistical modeling and distribution approximation. Heavy-tailed distribution modeling explicitly constructs non-Gaussian noise profiles to enhance robustness [2,5,14], yet its non-conjugacy leads to the lack of closed-form solutions, necessitating moment-matching approximations that introduce additional errors. Multi-model approaches, such as Gaussian mixture models [15] and conditioned Markov switching models [16], describe noise mutation characteristics through weighted multimodal distributions. However, their practical utility is constrained by model complexity and sensitivity to predefined weights. Monte Carlo methods, exemplified by particle filters [17,18], theoretically achieve arbitrary distribution approximation via stochastic sampling. Despite asymptotic optimality, their computational complexity in high-dimensional scenarios renders them impractical for real-time applications. Beyond conventional heavy-tailed modeling, recent advances have developed dual-path adaptive frameworks to simultaneously address non-Gaussian noise in system dynamics and measurement models. A representative example is the Mahalanobis distance-based CKF with both adaptability and robustness for tightly coupled GNSS/INS integration [19]. Collectively, the trade-offs among dynamic noise adaptation, computational efficiency, and theoretical closed-form solutions remain significant challenges for these methods. Recent advancements in information-theoretic learning (ITL) have introduced novel paradigms for enhancing filter robustness by reconstructing optimization objectives through higher-order statistics and nonlinear similarity measures (such as correntropy) [20]. Unlike minimum mean square error (MMSE)-based methods that rely solely on second-order moments, the maximum correntropy criterion (MCC) embeds local similarity into the cost function through kernel function mapping, using second-order and above moments of the innovation to effectively suppress outliers while capturing higher-order data characteristics [21,22,23]. Beyond traditional filtering, the MCC framework has been extended to specialized domains requiring robust non-Gaussian noise suppression. A state-of-the-art application is spectral redshift navigation [24] leveraging MCC to dynamically estimate stellar spectral shifts under cosmic noise. Such advancements underscore MCC’s potential in enhancing robustness for complex nonlinear systems beyond filtering designs. Although MCC sacrifices closed-form solutions for recursive estimation [20], its dynamic adaptability to heavy-tailed non-Gaussian noise has demonstrated unique advantages in complex scenarios, driving its adoption in robust adaptive filtering designs [20,25,26,27,28,29]. Notably, a critical parameter in MCC-based filters is the kernel size for adjusting the gain matrix, which governs outlier suppression efficacy and estimation stability [20,25]. Current kernel size selection strategies predominantly rely on predefined static values (e.g., empirically fixed via offline experiments [20,26,27]) or online empirical adjustments (e.g., heuristic calculations based on innovation covariance [25,28,29]). However, static methods exhibit strong scenario dependency, limiting generalizability across diverse noise characteristics, while online strategies fail to adapt to abrupt interference intensity variations due to fixed adaptation mechanisms. Although a kernel size optimization mechanism based on sliding-window heuristics [30] has been proposed, its noise adaptability remains constrained by predefined window parameters, exhibiting inherent latency in responding to abrupt noise disturbances. Fundamentally, existing approaches lack an adaptive association mechanism between kernel size and dynamic noise statistics, hindering real-time adaptability in scenarios with time-varying noise distributions or sudden changes in interference intensity. Furthermore, current research predominantly targets single-objective scenarios, lacking extended modeling for multi-target complex environments. These limitations subject existing methodologies to dual challenges of degraded estimation precision and computational efficiency imbalance in scenarios involving abrupt noise characteristic transitions or multi-target interactive environments.
To address these challenges, variational Bayesian (VB) methods have emerged as a promising framework for handling complex posterior distributions and joint estimation problems [31,32,33,34]. By probabilistically modeling latent variables (e.g., kernel size) and leveraging conjugate prior properties, VB enables joint Bayesian inference of states and parameters. Building on this foundation, this paper innovatively proposes a dynamic joint estimation mechanism for kernel size and state by integrating variational Bayesian inference with the theory of MCC. Additionally, it incorporates cubature transformation techniques to minimize approximation errors in nonlinear systems. Consequently, the variational Bayesian-based maximum correntropy criterion cubature Kalman filter (VBMCC-CKF) is proposed, showcasing enhanced optimization capabilities. The key innovations are reflected in the following three aspects.
(1)Integration architecture of VB and maximum correntropy criterion cubature Kalman filter (MCC-CKF):
For the first time, this method combines VB inference with MCC-CKF to construct a joint estimation framework of state and kernel size, breaking the dependence of traditional filtering algorithms on Gaussian noise assumptions and fixed kernel parameters. By synchronously optimizing the posterior distribution of the state and the kernel size parameters, it achieves fully adaptive robust estimation under nonlinear and non-Gaussian noise.
(2)Adaptive kernel size dynamic adjustment mechanism based on conjugate distribution:
A probabilistic modeling paradigm is introduced by treating the kernel size as an inverse gamma-distributed random variable [35]. Leveraging the conjugate relationship between Gaussian and inverse gamma distributions, a closed-form analytical solution is derived through a variational Bayesian online alternating optimization. This mechanism significantly enhances robustness against heavy-tailed noise and outliers while maintaining linear computational complexity for parameter updates, ensuring real-time applicability.
(3)Scalability in single/multi-target tracking scenarios:
VBMCC-CKF demonstrates exceptional generalizability across tracking scenarios. In single-target benchmarks, it delivers superior robustness against complex disturbances. When extended to multi-target tracking via a multi-Bernoulli filtering framework [36,37,38], the method dynamically balances multi-dimensional noise suppression and computational efficiency. This enables precise continuous tracking in cluttered environments with time-varying target interactions, providing a modular solution for advanced perception systems.
The structure of this paper is arranged as follows: Section 2 systematically expounds the theoretical framework of CKF based on MCC, highlighting the limitations of conventional approaches and motivating this research. Section 3 details the proposed adaptive kernel optimization methodology, focusing on the variational Bayesian iterative framework and its mathematical derivations. Section 4 theoretically analyzes robustness mechanisms and evaluates computational complexity. Section 5 validates the algorithm through comparative simulations: single-target benchmark evaluations followed by multi-target simulations under dynamic clutter. Section 6 concludes with research summaries and future directions in nonlinear non-Gaussian systems.
2. Problem Formulation
2.1. Principle of CKF
Consider a nonlinear dynamic system [29]:
where denotes the state vector at time , denotes the measurement vector at time , and are the nonlinear state transition and measurement functions, respectively. The process noise and measurement noise are mutually uncorrelated white Gaussian noise with covariance matrices and , respectively. CKF employs a set of deterministic cubature points to approximate the posterior mean and error covariance of the state under nonlinear transformations with additive Gaussian noise, thereby enabling robust state estimation through effective exploitation of noise-contaminated measurement information.
The CKF algorithm initializes with the prior state estimate and covariance , followed by recursive execution of two steps: prediction and update.
Prediction Step: At time , given the posterior state estimate and covariance , generate symmetric cubature points:
where is the Cholesky decomposition of . Propagating these points through yields the transformed cubature points , . The predicted state mean and covariance are computed as follows:
Update Step: Propagate the predicted cubature points through to generate the measurement cubature points , . The predicted measurement mean, measurement covariance, and cross-covariance are calculated as follows:
Calculate the Kalman gain as follows:
Finally, the estimate and covariance of the posterior state are updated via the following:
2.2. Principle of MCC
For the random variables , , the correntropy quantifies their generalized similarity and is defined as follows [29]:
where denotes the expectation operator, and is the joint probability density function. Because the Gaussian kernel function can approach the nonlinear model infinitely, the kernel function satisfying Mercer’s theorem is designed, with the kernel size adjusted dynamically and adaptively:
By expanding the Gaussian kernel function in Equation (12) into a Taylor series and substituting it into (11), we have the following:
It is evident that correntropy constitutes a weighted sum of even-order moments of , thereby incorporating higher-order statistical information beyond the second moment. When is sufficiently large, correntropy is determined by the second moment. Furthermore, due to the general unavailability of and the finite samples , , correntropy is empirically estimated using the sample mean.
Let denote the model output that depends on the parameter vector , and is the desired response. The residual term depends on . MCC is defined as the optimization framework that identifies the optimal parameter vector from a feasible set by maximizing the empirical correntropy in (14), formally expressed as follows:
In practical engineering applications, systems are inevitably subjected to various interference signals characterized by significant uncertainties in temporal distributions and amplitude intensities. A representative example includes outlier measurements generated by low-reliability sensors. Under such complex operating conditions, the conventional assumption of Gaussian-distributed process and measurement noise often fails, as system noise typically exhibits non-Gaussian heavy-tailed statistical properties (e.g., Gaussian mixture or Student’s t-distributions). These characteristics disrupt the optimality conditions of CKF, leading to substantial degradation in state estimation accuracy and filter stability, thereby severely limiting CKF practical applicability in non-Gaussian noise environments.
To address the challenges posed by unknown disturbances and non-Gaussian heavy-tailed noise in dynamic systems, numerous robust filtering algorithms have been proposed in recent years. Among these, Kalman filter variants based on MCC have emerged as a research focus due to their exceptional disturbance rejection capabilities. In contrast to the traditional MMSE criterion, which relies solely on second-order moments of the innovation, the MCC framework enhances robustness against the abnormal disturbances by jointly exploiting both second- and higher-order moment information of the innovation. Consequently, integrating MCC into the CKF framework yields a robust hybrid algorithm—MCC-CKF. This algorithm establishes a cooperative optimization mechanism between the error covariance matrix and the kernel function, effectively improving state estimation accuracy and filtering stability.
2.3. Principle of MCC-CKF
In the framework of a nonlinear dynamic system model, the optimization objective function based on MCC is formulated as follows:
where denotes the squared Mahalanobis distance of the covariance matrix , representing the standardized discrepancy under the covariance structure.
To address the limitation that a traditional Gaussian kernel function is not sensitive to capturing feature correlations, a Gaussian kernel function based on a square Mahalanobis distance is reconstructed as follows:
The optimal estimate is obtained by maximizing ,
Consequently, the state estimate of the original CKF in (9) is rewritten as
where is defined as follows:
Here, serves as an additional adjustment factor to suppress the influence of anomalous disturbances and iteratively refine the estimation accuracy of .
The modified measurement covariance matrix is
Accordingly, the covariance update is modified to
The complete MCC-CKF (Algorithm 1) is summarized as follows.
Algorithm 1: MCC-CKFInput: (1) Prediction:Generate symmetric cubature points in (2), and calculate the predicted state in (3) and the corresponding covariance in (4).(2) Update:Predict measurements in (5);Initialize , and set , and as the iteration threshold.Iterate:The adjustment factor in (21), the measurement covariance in (22), the cross-covariance in (7), and the Kalman gain in (20) are calculated successively.Refine the state estimate in (19) and the covariance in (23).Until: ;Return and , where is iterations.Output: , .
2.4. Limitation and Improvement of MCC-CKF
As illustrated in Algorithm 1, MCC-CKF extends CKF to non-Gaussian environments by adaptively weighting the innovation covariance through a kernel function and iteratively optimizing the Kalman gain and covariance matrices using the adjustment factor . This mechanism effectively suppresses the influence of large estimation errors, significantly enhancing robustness under non-Gaussian noise. However, this improvement incurs higher computational complexity due to kernel operations, iterative optimization, and parameter adjustment—particularly, the selection of the kernel size . As a critical hyperparameter, directly governs the computation of and propagates its sensitivity to external disturbances to the gain matrix by means of . In (21), a smaller reduces , leading to diminished gain magnitudes in (20) and potential performance degradation or even divergence under Gaussian noise. Conversely, if , , then and , and MCC-CKF degenerates into the standard CKF. Thus, the rational selection of kernel size is pivotal to balancing estimation accuracy and robustness in MCC-CKF.
Given the non-stationary nature of external disturbances in temporal distribution and amplitude, the optimal kernel size varies with time-varying disturbance environments. Therefore, it is necessary to adopt the kernel scale adaptive adjustment strategy based on real-time data to make the adjustment factor dynamically respond to disturbance variations. This mechanism endows the filter with real-time responsiveness to unknown time-varying disturbances, achieving a balanced optimization between disturbance rejection and estimation accuracy through parameter coupling. Existing approaches, such as the adaptive method in [25], dynamically adjust the kernel size based on the weighted norm of measurement residuals at each iteration ( ). While this method improves robustness against non-Gaussian noise compared to filters relying on default or heuristic kernel selections [20,26,27], integrating the adjustment factor with a residual-dependent adaptation of the kernel size constrains to a static value, thereby preventing real-time responsiveness to varying disturbance intensities. Consequently, this adaptive strategy remains insufficient to counteract unknown disturbances of varying intensities. Therefore, kernel size adjustment transitions from static or heuristic approaches to data-driven probabilistic inference, substantially enhancing the filters’ robustness in non-stationary non-Gaussian environments.
To eliminate reliance on heuristic parameter tuning, this paper proposes the VBMCC-CKF framework, which recursively estimates the system state while dynamically adapting the kernel size via VB theory [33]. Leveraging VB theory, the joint posterior distribution of the state and kernel size is approximated using factorized distributions (Gaussian and inverse gamma), ensuring computational tractability and efficiency. Integrated with the state estimation framework of MCC-CKF, the proposed method achieves real-time adaptability to unknown time-varying disturbances and significantly improves robustness in non-stationary non-Gaussian environments.
3. VBMCC-CKF with Adaptive Kernel Size Adjustment Mechanism
3.1. Kernel Size Modeling and Joint Estimation of State and Kernel Size
Let the Gaussian kernel size and the state be modeled as random variables and define . Leveraging the conjugacy of the Gaussian-inverse gamma distribution, which ensures closed-form posterior updates and computational efficiency, the kernel size is modeled as a random variable following an inverse gamma distribution:
The state variables follow a Gaussian distribution:
To jointly estimate the state variables and kernel size, the joint posterior distribution is formulated as
where denotes the joint posterior probability density function, and represents the measurement sequence of the sensor from the start to the current time . The objective is to estimate the state and kernel size conditioned on .
3.2. Variational Bayesian Iterative Optimization
Because of the coupling of and in joint posterior distribution, the analytic solution of joint posterior distribution cannot be derived directly by using Bayes’ rule. To decouple the joint posterior, the variational Bayesian approximation [33] is employed to decompose the joint posteriori distribution into two factorized independent distributions and .
The Kullback–Leibler (KL) divergence quantifies the discrepancy between two probability distributions. Within the variational Bayesian framework, the goal is to minimize the KL divergence between the factorized approximation and the true joint posterior , expressed as follows:
Optimizing towards minimal KL divergence inherently ensures the evidence lower bound (ELBO) reaches its theoretical maximum [39], thereby identifying optimal and . The variational Bayesian iterative optimization realizes the online adaptive selection of kernel size by alternating between updating and and progressively approximating the true posterior distribution. Its core value is to transform the complex joint estimation problem into a subproblem that can be solved analytically, that is, the alternating optimization calculation of two independent distributions, while ensuring the real-time and anti-interference ability of the algorithm. The alternating optimization rule is as follows:
(1)Fix the kernel size and update the state distribution : the following update rule is derived analytically by minimizing .
(2)Fix the state and update the kernel size distribution : the following update rule is derived analytically by minimizing .
Remark 1. The conjugacy between the Gaussian distribution (state) and the inverse gamma distribution (kernel size) guarantees the closed-form solution of the posterior distribution. The updated formulas of state and kernel size are derived by minimizing the KL divergence. All updated formulas are directly derived by analytic expectation calculation, requiring only 2–3 iterations for convergence. This avoids Monte Carlo integration or numerical optimization, satisfying real-time computational demands.
3.3. Closed-Form Derivation
The closed-form derivation hinges on leveraging the conjugacy between the Gaussian and inverse gamma distributions within the variational Bayesian framework, enabling analytical updates of the joint posterior distribution for the state and kernel size . The detailed derivation proceeds as follows:
(i) Initialization: To facilitate recursive computation, it is assumed that the prior distributions of the state and the kernel size at time also satisfy the conjugation condition, i.e., and . The joint posterior probability density function in (31) is approximated as the product of the Gaussian distribution and the inverse gamma distribution.
(ii) Prediction Step: To compute the joint predicted probability density function at time , a dynamic evolution model is formulated for the kernel size . The kernel size parameters propagate temporally through a heuristic model, which embodies an exponentially decaying memory mechanism. This mechanism ensures that historical information diminishes gradually over time, thereby enhancing adaptability to dynamically disturbed environments.
where governs the time-varying behavior of the kernel size, referred to as the decay factor. Smaller values of accelerate the temporal adaptation of the kernel size, enabling rapid response to dynamic disturbed conditions.
Given the absence of physical correlation between the state variables and kernel size, their dynamic evolution models are postulated to be mutually independent:
Further, leveraging the Chapman–Kolmogorov equation [33], the independence of the predicted probability density functions and is established, leading to the following:
Given the propagation characteristics of Gaussian noise, the initial state undergoes deterministic sampling (e.g., cubature points in (2)) for the nonlinear state transition function in (1). By integrating the predicted state mean in (3), predicted covariance in (4), and Equations (31) and (34), the state prediction distribution remains approximately Gaussian: . Similarly, leveraging the dynamic evolution model for the kernel size in (32), the predicted distribution of kernel size retains its inverse gamma form: . Therefore, substituting and into Equation (34), the joint predictive probability density function is calculated as follows:
(iii) Update: The joint posterior distribution of the state and kernel size is analytically updated via the variational Bayesian iterative optimization outlined in Section 3.2, resolving the coupling between and . By minimizing the KL divergence in (28), the factorized distributions and are alternately optimized.
(1) Update state distribution (fix ): Integrating the predictive probability density function in (34), the joint probability can be decomposed as follows:
where represents the joint likelihood function. This likelihood is defined by the Gaussian kernel in (37), which incorporates second- and higher-order moment information of the innovation to mitigate the influence of disturbances on the dynamic system:
where denotes the measurement dimension. The variational update proceeds by integrating Equations (35) and (37) and computing the expectation of with respect to , yielding the following:
Consequently, retains a Gaussian form as the product of two Gaussian distributions, with the measurement noise covariance scaled by the following:
For , the state estimate and covariance are updated as follows:
where is defined as follows:
The measurement covariance matrix is
The cross-covariance matrix is consistent with Equation (7):
(2) Update kernel size distribution (fix ):
The variational update proceeds by computing the expectation of with respect to and yielding the following:
Therefore, retains its inverse gamma distribution. By applying the linearization approximation , the residual yields the following closed-form update for the kernel size parameter:
The expectation of the kernel size is .
Remark 2. For online adaptability of the kernel size, dynamically adjusts based on the residual . When increases (indicating the presence of outliers), increases, thereby enlarging to reduce the weighting of current measurements (suppressing disturbance). Conversely, when decreases, diminishes, enhancing the utilization of measurement information. This mechanism ensures adaptive adjustment of the kernel size parameter in response to varying disturbance intensities.
Through the aforementioned derivations, the joint posterior probability density function at time can be approximated as the product of a Gaussian distribution and an inverse gamma distribution after alternating optimization. The parameters of the Gaussian and inverse gamma distributions are iteratively determined via Equations (40)–(44) and (46)–(47), respectively.
The comprehensive process of the proposed VBMCC-CKF method is outlined below (Algorithm 2), integrating the alternating optimization framework and closed-form parameter updates derived above.
Algorithm 2: VBMCC-CKFInput:
- (1)Prediction:
a. Generate symmetric cubature points (Equation (2)) for state propagation (Equation (3)) and covariance matrix computation (Equation (4)).b. Obtain and through the dynamic evolution model defined in (32).c. Compute the joint predictive probability density function using Equation (35).
- (2)Update:
a. Propagate the predicted cubature points through the measurement function to generate measurement cubature points: , . Predict the measurement mean via Equation (5).b. Initialize with , , , and . Subsequently, execute the variational Bayesian iterative optimization as follows:(1) Compute the kernel size .(2) Calculate the measurement covariance matrix using Equation (43) .(3) Compute the cross-covariance matrix using Equation (44) .(4) Compute the Kalman gain matrix using Equation (42): .(5) Update the state estimate and covariance matrix via Equations (40) and (41): .(6) Update the kernel size parameters: , where .(7) Iteration termination criterion: If , return to Step (1) to continue iteration. Otherwise, set , , , and , where denotes the total number of iterations. Finally, compute the joint posterior probability density function and return .Output: .
4. Analysis of VBMCC-CKF
Section 3.3 rigorously derives the closed-form solution for the joint posterior distribution of the state and kernel size in VBMCC-CKF using variational Bayesian methods. The derived joint posterior retains an approximate factorization into Gaussian and inverse gamma distributions after iterative optimization. This section first supplements the mathematical validation with a focus on the closed-form kernel size update. Subsequently, anti-disturbance mechanism analyses are conducted to evaluate the robustness of the kernel size adaptation. Finally, the computational complexity of VBMCC-CKF is analyzed to assess its practical feasibility.
4.1. Mathematical Verification Using Kernel Size Closed-Form Update as an Example
Assuming the prior distribution of the kernel size as , the predictive distribution is obtained by the dynamic evolution model propagation in (32):
The measurement likelihood function is defined as follows:
According to Bayes’ theorem, the posterior distribution of the kernel size is proportional to the product of the likelihood and prior:
Substituting the specific forms of Equations (49) and (50) into Equation (51) yields the following:
The posterior distribution retains the inverse gamma form , thereby validating the correctness of the closed-form update mechanism.
4.2. Anti-Disturbance Mechanism Analysis
A steady state implies that the system state and parameters no longer undergo significant changes over time, meaning the parameters converge to either a fixed value or exhibit periodic variation. For kernel size parameters, steady state analysis requires determining the long-term behavior of the kernel scale parameters and whether the expected value converges with it. The kernel size parameters are dynamically updated through an adaptive adjustment mechanism governed by Equations (46) and (47) and the model in (32). Under the condition that the residual exhibits bounded expectation (indicating system stability), if the covariance of tends to stabilize, then the expected value of satisfies , where is the steady-state covariance matrix of . Furthermore, based on the model in (32) and the update formulas in (46) and (47) of and , the parameters and demonstrate a linear growth trend over time, asymptotically approaching stable values, which are denoted as and . Consequently, the expectation of the kernel size converges to a steady-state value , thereby ensuring the stability of the gain matrix. In scenarios where outliers cause abrupt increases in , rapidly increases, triggering an adaptive adjustment of to reduce the gain matrix and suppress outlier interference effectively. Subsequently, due to the effect of the decay factor in smoothing historical information, it avoids over-suppressing normal data and gradually returns to the steady state.
Steady-state analysis validates the long-term stability and reliability of VBMCC-CKF, ensuring robust estimation performance in diverse adversarial environments. Notably, compared to the MCC-CKF in Algorithm 1, the gain matrix is reformulated by integrating Equations (7), (20) and (22), as follows:
Similarly, by integrating Equations (42)–(44), the gain matrix of VBMCC-CKF is reformulated as follows:
A comparative analysis of and reveals that VBMCC-CKF and MCC-CKF are structurally equivalent. Both the reciprocal of the adjustment factor ( ) and the kernel size ( ) serve to modulate in response to disturbances. Nevertheless, regardless of whether the kernel size is selected based on default values, empirical rules, or adaptive adjustment, MCC-CKF exhibits limited efficacy in mitigating unknown disturbances of varying intensities. Consequently, VBMCC-CKF’s dynamic adaptation of in response to disturbance intensity confers superior robustness against unknown time-varying disturbances.
4.3. Computational Complexity
The proposed VBMCC-CKF integrates VB methods with CKF while adaptively adjusting the kernel size parameters. Compared to the conventional CKF, VBMCC-CKF incurs a modest increase in computational complexity. Therefore, a concise analysis of its computational complexity is conducted from both time and space complexity perspectives.
In terms of time complexity, the computational complexity of CKF is dominated by the calculation of the Kalman gain, with a time complexity of . VBMCC-CKF introduces variational Bayesian iteration on this basis, requiring multiple iterations in the update step to optimize the state estimation and kernel size parameters. During each iteration, the Kalman gain, related covariance matrices, and kernel size parameters are recalculated. While the scalar operations for updating kernel size parameters introduce minor computational overhead, their impact on the overall complexity is negligible. Consequently, the time complexity per iteration remains equivalent to the CKF update step. Given that the variational iterations typically converge within 2–3 cycles (denoted as ), the total time complexity of VBMCC-CKF is . Although the total computational load increases linearly with , its complexity is still of the same order as , and the overall time complexity is controllable.
In terms of space complexity, CKF mainly stores core data, such as cubature points, covariance matrices, and Kalman gain matrices, resulting in a space complexity of . For VBMCC-CKF, additional intermediate variables, such as iteratively updated state estimates and kernel size parameters, must be stored during variational iterations. However, since these variables share the same dimensionality as the core matrices in CKF, no high-dimensional storage overhead is introduced. Thus, the space complexity remains .
This comprehensive analysis demonstrates that VBMCC-CKF achieves significant improvements in robustness and estimation accuracy with only a modest computational overhead, rendering it highly advantageous for addressing challenges in non-Gaussian noise environments. While its time complexity exceeds that of the conventional CKF, the algorithm attains convergence within merely 2–3 iterations through efficient computational steps, thereby satisfying real-time operational requirements and proving well-suited for practical applications demanding real-time performance.
5. Simulation Setup and Performance Validation
This section evaluates the estimation performance of the proposed filtering algorithms under two representative scenarios: single-target tracking and multi-target tracking in non-Gaussian noise environments. All simulations were conducted on a hardware platform equipped with AMD Ryzen 7 7840H CPU and 32 GB RAM to ensure computational reproducibility. The bearing-range measurement models and bearing-only model emulate typical radar sensing principles. The noise parameters are configured to reflect real-world non-Gaussian disturbances.
5.1. Single-Target Tracking
5.1.1. Benchmark System
The single-target tracking model and sensor model, including the bearing and range, are established. Set a vector that represents the target position and velocity and measurement vector at time .
where , and .
The sampling period is , and the turn rate is . The process noise and measurement noise are and and , and . is heavy-tailed, , and , where and are given. Assuming that the initial state and its estimates are identical, , and the covariance matrix is .
5.1.2. Baselines and Metrics for Single-Target Tracking
The comparative methods employed in this simulation comprise five distinct Kalman filter variants: (1) standard CKF; (2) Mahalanobis distance-based adaptive robust CKF (MD-ARCKF); (3) MCC-CKF, utilizing a fixed kernel size (FKSMCC-CKF); (4) empirical MCC-CKF, incorporating experience-based kernel size selection (EKSMCC-CKF); and (5) the novel VBMCC-CKF featuring a kernel size adjustment mechanism based on the variational Bayesian framework. This systematic comparison framework enables comprehensive evaluation of both established techniques and our proposed kernel size adjustment strategy. The convergence threshold is established at to govern iteration termination. Within the VBMCC-CKF framework, the hyperparameters governing the kernel size are initialized to , representing the prior shape and scale factors, respectively. To ensure rigorous performance assessment, we implemented Monte Carlo simulations comprising independent realizations, with the root mean square error (RMSE) serving as the statistical evaluation metric. The RMSE formulation is mathematically expressed as
where denotes the coordinate of the target at discrete time step during the -th Monte Carlo realization ( ), while represents the corresponding posterior state estimate generated by the filtering algorithm. The average of RMSE (Avg-RMSE) serves as an additional evaluation metric to evaluate the performance over the entire tracking duration, where , and is the entire tracking duration.
Two complementary metrics are employed for the performance assessment: (1) the average iterations per time step, the reflecting convergence rate, and (2) the average CPU time, measured via Monte Carlo trials. The former quantifies optimization stability, while the latter benchmarks computational efficiency.
5.1.3. Results and Discussion for Single-Target Tracking
The proposed VBMCC-CKF demonstrates remarkable advantages in nonlinear target tracking scenarios. As illustrated in Figure 1, the estimated trajectories of VBMCC-CKF exhibit superior alignment with the true motion paths and enhanced smoothness compared to baseline methods such as FKSMCC-CKF, EKSMCC-CKF, MD-ARCKF, and CKF. Quantitative analyses in Figure 2 further validate this superiority: under strong non-Gaussian noise interference, CKF exhibits a peak RMSE fluctuation of 137.0961 m, indicating the worst estimation performance. In contrast, the MCC adopted by the other three methods can more accurately capture the heavy-tail characteristics of large Gaussian noise interference by integrating the higher-order moments of innovation, significantly improving the robustness of tracking estimation. However, the fixed kernel size of FKSMCC-CKF is difficult to adapt to the coupling effects of multi-dimensional measurement variables, resulting in the continuous growth of RMSE. Although the kernel size of EKSMCC-CKF can be adjusted dynamically, it is easy for the adjustment factor to be constant, and the anti-interference effect is limited. The proposed VBMCC-CKF in this paper achieves robust tracking with the lowest RMSE, consistently below 45 m. Specifically, as shown in Table 1, the Avg-RMSE of VBMCC-CKF (34.5919 m) is reduced by 50.87%, 31.65%, and 14.33% compared to FKSMCC-CKF (70.4105 m), EKSMCC-CKF (50.6090 m), and MD-ARCKF (40.3779 m), respectively. This performance enhancement stems from its innovative dynamic kernel size adaptation mechanism, which rapidly responds to varying noise intensities, and the dynamic balance of the kernel size in different dimensions, which realizes the collaborative suppression of noise in different dimensions.
In terms of computational efficiency, Table 1 demonstrates that the proposed VBMCC-CKF achieves an average single-step computation time of 0.0092 s, which aligns with the time complexity of baseline methods, thereby satisfying real-time processing requirements. Notably, while maintaining tracking precision, VBMCC-CKF exhibits an average number of iterations (2.13) comparable to FKSMCC-CKF (1.96) and EKSMCC-CKF (1.99). This efficiency is attributed to its convergence criterion, which terminates 86.43% of iterations within 2.3 steps, ensuring minimal computational overhead without compromising robustness.
Furthermore, a sensitivity analysis of two critical parameters—the initial kernel size ( ) and the decay factor ( )—was conducted. As depicted in Figure 3a, the Avg-RMSE exhibits a marginal variation of approximately 0.4 when is within the interval [1.5, 4.0], demonstrating that variations in within this range exert negligible influence on algorithmic performance, thereby underscoring the robustness of the proposed method. Based on this analysis, was ultimately set to 1.5 to optimize computational stability while maintaining high tracking accuracy. As shown in Figure 3b, the increase in the decay factor significantly reduces the average iteration count, with its impact exhibiting interval-dependent characteristics. Specifically, when , the average iteration count decreases rapidly (a reduction of 9%), whereas a smaller reduction (approximately 5%) is observed for . Correspondingly, although the Avg-RMSE decreases synchronously with the increase in , the overall change range is limited. Notably, within , the Avg-RMSE fluctuates only by approximately 0.4, indicating negligible influence on tracking accuracy. Therefore, selecting optimally balances computational efficiency and estimation precision, thereby maximizing algorithmic performance without compromising robustness.
The simulation results demonstrate that under strong non-Gaussian noise interference, the proposed VBMCC-CKF not only enhances tracking estimation accuracy and robustness but also maintains real-time processing capabilities. Furthermore, it provides a novel framework for state estimation in multi-dimensional nonlinear systems by dynamically optimizing kernel parameters to balance measurement credibility across diverse dimensions. This capability holds significant engineering value for applications such as radar systems and multi-sensor fusion architectures.
5.2. Multi-Target Tracking
5.2.1. Scenario Construction
In complex dynamic environments, multi-target tracking faces multiple challenges, including bearing-angle measurements, target maneuverability, and non-Gaussian noise characteristics. This simulation simulates the motion of seven targets within a 2000 m × 2000 m surveillance area, observed via a dual-sensor system. The targets follow a variable turn rate model, where in the original target model (55) is time-varying:
where denotes angular acceleration, the sampling interval is , and the augmented state is defined as . The state transition model is formulated as follows:
where the system noise components and follow independent t-distributions: and . Here, represent the degrees of freedom, with scale matrices defined as and , where and govern noise intensity, and is the identity matrix. The dual-sensor system employs a bearing-angle measurement mode. Let denote the coordinates of the -th sensor at time . The measurement equation is defined as follows:
where the measurement noise follows a mixture t-distribution: Under nominal conditions, , where , and . At an anomalous time , the system introduces outliers with equal probability, where the scale matrix switches between and .
The motion trajectories of two sensors, and , are configured to follow a constant-velocity curvilinear pattern, with their kinematic parameters defined as follows:
The system is configured with a target detection probability , and the simulation duration is set to 100 s, the clutter model employs a spatial Poisson process with a density of , corresponding to an average of two clutter points per scan.
To address the challenges of multi-target tracking in non-Gaussian noise environments, a validation framework is designed based on Random Finite Set (RFS) theory [40]. The framework constructs a bearing-only measurement scenario to rigorously evaluate complex conditions, such as stochastic target birth/death, clutter interference, and missed detections/false alarms. To circumvent the limitations of traditional data association methods, a multi-Bernoulli filtering framework [36] is employed, enabling unified target set estimation within a probabilistic space. In light of the multi-sensor configuration inherent to this application, a sequential filtering architecture is strategically adopted to ensure scalable sensor data integration and real-time processing feasibility. Three recursive algorithms are comparatively assessed: CKF, EKSMCC-CKF, and VBMCC-CKF. This design effectively isolates the impact of association uncertainties on algorithmic evaluation, highlighting the pivotal role of non-Gaussian noise mitigation mechanisms.
The birth target distributions for EKSMCC-CKF and VBMCC-CKF are defined as a four-component multi-Bernoulli process:
with the following parameter configurations:
(1)Existence probabilities: .(2)Mean vectors:
(3)Covariance matrix: .
The process noise is modeled as follows:
where the measurement noise is assumed to follow . To maintain baseline consistency across comparative simulations, CKF excludes the turn rate auxiliary variable , while all other parameters remain identical to those of EKSMCC-CKF and VBMCC-CKF. Specifically, the turn rate in the state transition matrix is fixed at rad/s, and the target survival probability is set to .
5.2.2. Multi-Target Tracking Evaluation Metrics
To quantitatively assess the comprehensive performance of multi-target tracking algorithms, the Optimal Subpattern Assignment (OSPA) distance is adopted as the core metric [36]. This metric provides a multi-dimensional evaluation by jointly quantifying target cardinality estimation errors and state estimation deviations. For a ground-truth target set and an estimated set , the OSPA distance is computed as follows:
where defines the truncated Euclidean distance (with a truncation distance of ), amplifies the penalty for large localization errors, and denotes the set of permutations of elements. The truncation mechanism distinguishes localization errors from missed detections/false alarms via cardinality mismatches ( term), offering a unified benchmark for tracking performance evaluation.
The average OSPA (Avg-OSPA) metric is utilized to quantify multi-target tracking performance over the full observation period, defined as , where represents the total tracking duration.
Furthermore, two supplementary metrics are introduced for multi-target tracking performance evaluation: the Hit Rate (HR), defined as the percentage of estimated positions deviating less than 40 m from their true positions, and the RMSE of tracked points (RMSE-TP), which quantifies localization accuracy exclusively for trajectory segments where tracking is sustained. These metrics collectively enhance the granularity of performance assessment by distinguishing between tracking continuity (via HRs) and precision in successfully maintained tracks (via RMSE-TP).
5.2.3. Results and Discussion for Multi-Target Tracking
The motion trajectories of multiple targets and sensor configurations are depicted in Figure 4, while Figure 5 presents representative simulation results for multi-target trajectory estimation under dynamic scenarios. In terms of estimation accuracy, CKF exhibits significant cumulative errors for Target 1, 2, and 3 due to its assumption of a fixed turn rate, which neglects target maneuverability.
EKSMCC-CKF reduces modeling errors by incorporating a time-varying turn rate auxiliary variable and improves tracking performance on the basis of empirical MCC, yet it suffers from target loss under anomalous noise interference. In contrast, by adaptively adjusting the kernel size via variational Bayesian methods in response to disturbance intensity, VBMCC-CKF achieves complete tracking of all targets.
Statistical results for target cardinality estimation robustness are shown in Figure 6. VBMCC-CKF demonstrates superior performance in false alarm control: during anomalous noise intervals, VBMCC-CKF achieves the lowest false alarm rate, whereas CKF exhibits the highest rate due to its susceptibility to accumulated estimation errors, which frequently induce erroneous judgments. Notably, CKF’s false alarm rate even exceeds that of EKSMCC-CKF. In terms of missed detection mitigation, VBMCC-CKF demonstrates a significantly lower mean error in target cardinality estimation compared to the other two methods. This superiority stems from its integration of higher-order innovation terms and an adaptive kernel size adjustment mechanism that dynamically suppresses outliers. Consequently, VBMCC-CKF rapidly converges to stable estimates when targets abruptly appear or disappear, resulting in the lowest missed detection rate. In contrast, CKF, which is highly sensitive to abrupt state transitions, accumulates substantial errors under rapid target population changes, leading to the highest missed detection rate.
The comprehensive performance is evaluated via the OSPA distance and the supplementary metrics of HRs and RMSE-TP, as illustrated in Figure 7 and Table 2. VBMCC-CKF demonstrates the smallest fluctuation in OSPA distance, consistently maintaining values below 33 m. Furthermore, it achieves an average OSPA (Avg-OSPA) of 11.897 m, representing a significant reduction of 40% compared to EKSMCC-CKF (19.8291 m) and 59.43% relative to CKF (29.3081 m). During the final maneuver phase, the OSPA distance for CKF surges due to a model mismatch, while VBMCC-CKF, leveraging its time-varying turn rate model and integrating higher-order innovation terms with adaptive kernel scaling, maintains higher estimation accuracy for both target states and cardinality, thus ensuring stable errors. The statistical data for HR and RMSE-TP in Table 2 demonstrate that VBMCC-CKF achieves an RMSE-TP of 11.897 m with 97.39% valid tracking frames and a minimal loss rate of 2.61%. Notably, VBMCC-CKF outperforms the other two methods by attaining the highest HR value, indicating its superior capability to maintain robust tracking under non-Gaussian noise disturbances. Furthermore, its RMSE-TP is the lowest among all compared methods, confirming significantly enhanced estimation accuracy during sustained target tracking. These results collectively validate VBMCC-CKF’s exceptional precision and adaptability in dynamic multi-target scenarios.
The performance discrepancies among the three methods originate from distinct core design philosophies. First, explicit modeling of time-varying turn rates via auxiliary variables ensures motion model completeness, thereby reducing turn rate estimation errors. Second, regarding abnormal disturbance suppression mechanisms, MCC suppresses the influence of measurement outliers through online adaptive kernel size adjustment. Under noise disturbances with a 10-fold covariance (abrupt changes), the state estimation variance is drastically reduced. Finally, in balancing computational efficiency, as shown in Table 2, VBMCC-CKF incurs a marginally higher per-step computational time (0.324 s) compared to EKSMCC-CKF (0.3064 s) due to the added variational Bayesian iterations for kernel size optimization. This demonstrates that robustness enhancements do not impose significant computational overhead.
6. Conclusions
In this paper, a novel VBMCC-CKF framework is proposed by integrating higher-order moment information from MCC with a variational Bayesian online alternating optimization framework. By adaptively adjusting the kernel size in response to disturbance intensity, the proposed method addresses the critical challenge of robust state estimation in dynamic systems under non-Gaussian noise environments. On the one hand, VBMCC-CKF enhances robustness against heavy-tailed noise and outliers by implicitly extracting higher-order statistical moments of innovation vectors through the Taylor expansion properties of the MCC Gaussian kernel, surpassing the limitations of traditional MMSE-based approaches. On the other hand, VBMCC-CKF models the kernel size as an inverse-gamma distributed random variable and leverages its conjugate relationship with Gaussian states to formulate a state-kernel size alternating optimization framework, which features a concise structure and rapid convergence within 2–3 iterations. This mechanism realizes the dynamic adjustment of the kernel size by minimizing the KL divergence and approximating the joint posterior distribution, which not only resolves the mismatch issue of the traditional MCC filter’s kernel size selection strategy under abrupt disturbance intensity changes but also incurs only a marginal increase in per-step computational overhead. Finally, the modular architecture of VBMCC-CKF enables seamless integration into multi-Bernoulli filtering frameworks, resolving the multi-target estimation problem in the unified probability space. Extensive validation across single-target and multi-target tracking scenarios highlights its generalized applicability. Thus, a theoretically rigorous adaptive kernel size optimization paradigm is established for non-Gaussian estimation by deeply integrating the variational Bayesian framework with MCC-CKF. Future work will explore autonomous kernel parameter optimization via deep learning to enhance adaptability in dynamic environments and will continue to expand investigations into conjugate distribution generalization, particularly exploring the modeling potential of Gamma and Beta distributions for kernel size adaptation and will then extend the novel filtering architectures based on variational inference to critical application domains, such as GNSS/INS integration.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Mallick M. Vo B.N. Kirubarajan T. Arulampalam S. Introduction to the Issue on Multitarget Tracking IEEE J. Sel. Top. Signal Process.2013737337510.1109/JSTSP.2013.2254034 · doi ↗
- 2Roth M. Özkan E. Gustafsson F. A Student’s t Filter for Heavy Tailed Process and Measurement Noise Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing Vancouver, BC, Canada 26–31 May 201357705774
- 3Zhang G. Zhang X. Zeng L. Dai S. Zhang M. Lian F. Filtering in Triplet Markov Chain Model in the Presence of Non-Gaussian Noise with Application to Target Tracking Remote Sens.202315554310.3390/rs 15235543 · doi ↗
- 4Wang H. Li H. Fang J. Wang H. Robust Gaussian Kalman Filter with Outlier Detection IEEE Signal Process. Lett.2018251236124010.1109/LSP.2018.2851156 · doi ↗
- 5Huang Y. Zhang Y. Li N. Wu Z. Chambers J.A. A Novel Robust Student’s t-Based Kalman Filter IEEE Trans. Aerosp. Electron. Syst.2017531545155410.1109/TAES.2017.2651684 · doi ↗
- 6Ribeiro M.I. Kalman and Extended Kalman Filters: Concept, Derivation and Properties Inst. Syst. Robot.20044337363741
- 7Kulikov G.Y. Kulikova M.V. The Accurate Continuous-Discrete Extended Kalman Filter for Radar Tracking IEEE Trans. Signal Process.20156494895810.1109/TSP.2015.2493985 · doi ↗
- 8Gao B. Yin H. Hu G. Zhong Y. Reliable Measurement Selection Mechanism-Based Tightly Coupled Inertial/Bionic Polarization Integration with Position Correction Aerosp. Sci. Technol.202415410952310.1016/j.ast.2024.109523 · doi ↗
