Adaptive Multi-Sensor Fusion Localization with Eigenvalue-Based Degradation Detection for Mobile Robots
Weizu Huang, Long Xiang, Ruohao Chen, Sheng Xu, Qing Wang

TL;DR
This paper introduces a robust localization system for mobile robots by combining multiple sensors and detecting sensor degradation in real time.
Contribution
The novel method uses eigenvalue-based degradation detection and smooth sensor fusion to improve localization accuracy and robustness.
Findings
The proposed system achieves stable centimeter-level localization accuracy.
It outperforms pure LiDAR-based approaches in terms of robustness and real-time performance.
The method effectively suppresses long-term drift using loop detection and factor graph optimization.
Abstract
Autonomous mobile robots require robust localization in complex and dynamic environments, where single-sensor solutions often fail due to accumulated drift or signal degradation. LiDAR–inertial odometry provides accurate short-term motion estimation, but suffers from long-term error accumulation, whereas RTK-GNSS offers absolute positioning that becomes unreliable under occlusion or multipath effects. To solve the above problems, this paper proposes an adaptive multi-sensor fusion positioning framework that dynamically fuses LiDAR, IMU, and RTK-GNSS data based on the real-time quality evaluation of sensors. The system uses the front-end tightly coupled LiDAR–IMU iterative extension Kalman filter (IEKF) as the core estimator and combines loop detection with incremental factor graph optimization to suppress long-term drift. In addition, a degradation detection method based on the minimum…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9- —National Natural Science Foundation of China
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Inertial Sensor and Navigation · Indoor and Outdoor Localization Technologies
1. Introduction
LiDAR-based SLAM has become a mainstream solution for mobile robot localization and mapping [1], with tightly coupled LiDAR–inertial odometry frameworks achieving notable progress in recent years [2]. FAST-LIO2 [3], for example, achieves real-time operation by employing an iterative extended Kalman filter that directly registers points to the map, whereas Point-LIO [4] distinguishes itself by optimizing computational efficiency via a planar feature representation scheme. FAST-LIVO2 [5] leverages photometric alignment to integrate visual data for superior robustness in texture-rich environments, an enhancement in local estimation that nevertheless remains constrained by the fundamental lack of absolute position references in odometry systems and thus accumulates inevitable drift over long-term operation [6].
GNSS-based positioning, particularly RTK, provides centimeter-level absolute accuracy in open environments [7]. Tightly coupled GNSS–vision–inertial systems have demonstrated improved robustness under short-term signal interruptions [8]. However, the reliability of RTK degrades when satellite signals are blocked or affected by multipath effects. In urban canyons, dense vegetation, and underground spaces, satellite visibility is often limited, causing positioning accuracy to deteriorate from centimeter-level to meter-level estimates [9].
Therefore, researchers generally adopt the method of multi-sensor fusion to improve system robustness by using the complementary characteristics between different sensors. However, many existing methods default to fixed sensor reliability, using static weights for fusion. This assumption often does not hold in real-world contexts: for example, in a scenario with sparse geometry, the LiDAR constraint is significantly weaker; when the GNSS signal is blocked, the uncertainty of the RTK rises sharply.
Multi-sensor fusion is usually used to improve the stability of positioning systems, and its core idea is to take advantage of the complementarity of different sensors in perception characteristics [10]. Around this goal, the research field proposes a variety of convergence frameworks. For example, LIO-SAM [11] maintains overall consistency by introducing GPS constraints directly into factor graph optimization; LIO-GC [12] introduces adaptive ground constraints to solve the altitude drift, which is prone to occur in rough terrain. D-LIO [13] simplifies the system process by starting from the odometer structure itself so that it no longer relies on explicit feature selection. Although implementations vary, most approaches are designed with sensor reliability as constant, so a fixed weight allocation strategy is employed. However, this assumption rarely holds true in real-world environments, such as a significant decrease in LiDAR information in scenarios with sparse geometry or a significant increase in RTK uncertainty under signal occlusion conditions, which can lead to changes in sensor performance over time. Consequently, static weighting strategies struggle to cope with the dynamic degradation of sensor quality [14].
In order to solve the above problems, this paper proposes an adaptive multi-sensor fusion positioning method with degradation perception ability, which can dynamically adjust the fusion strategy according to the sensor state so as to maintain stable and high-precision positioning performance in complex environments.
2. Methodology
2.1. System Architecture and Sensor Configuration
The system architecture is shown in Figure 1, and the overall hierarchical organization is adopted to improve the efficiency of data transmission between different types of sensors. The system divides the odometer calculation of high-frequency operation from the global correction process performed at low frequency, so that the optimization module with a large amount of calculation will not affect the stable operation of real-time state estimation. In this framework, the system incorporates three sensor modes:
LiDAR: We employ a Livox Mid-360 hybrid solid-state LiDAR (Livox Technology, Shenzhen, China). It scans 360° horizontally and ±15° vertically at 10 Hz. Thanks to its non-repetitive scanning pattern, it achieves a ranging accuracy of 1 cm with a maximum range of 150 m. Compared to mechanical spinning LiDARs, this solid-state design offers better reliability [15].
IMU: An ICM40609 six-axis MEMS IMU (TDK InvenSense, San Jose, CA, USA) provides angular velocity and linear acceleration data at 200 Hz. The angle random walk is 0.02°/ , and the velocity random walk is m/s/ . This high-frequency data is crucial for state prediction and compensating for motion distortion in the point cloud.
RTK-GNSS: Our dual-antenna RTK module tracks BeiDou, GPS, and GLONASS satellites, updating positions at 10 Hz. In open areas with good satellite coverage, it achieves a horizontal accuracy of 1 cm + 1 ppm using carrier-phase ambiguity resolution.
To synchronize time, we use the RTK’s 1PPS (Pulse Per Second) signal as the master clock, keeping synchronization errors under 5 μs. All algorithms run on an NVIDIA Jetson Orin NX (NVIDIA Corporation, Santa Clara, CA, USA) embedded computer with Ubuntu 20.04 and ROS 2 Galactic (Open Robotics, Mountain View, CA, USA).
Sensor Mounting and Calibration: All sensor modules are mounted at the center of the robot platform. The Livox Mid-360 LiDAR has an integrated IMU sensor within its housing. For RTK-GNSS positioning, we employ a dual-antenna configuration with antennas mounted at the front and rear of the robot (baseline approximately 40 cm) to provide heading information. The GNSS receiver module is co-located with the Jetson computing unit in a central electronics enclosure. The extrinsic calibration between LiDAR and IMU was performed using the method proposed in [3], achieving a rotation error of <0.3° and translation error of <5 mm. The transformation from the LiDAR-IMU frame to the GNSS antenna baseline center was measured mechanically with an accuracy of mm. The gravity vector alignment was initialized using static IMU measurements over 30 s, yielding a roll/pitch accuracy of <0.1°.
2.2. State Definition and Problem Formulation
We represent the robot state vector as
where denotes position expressed in the world frame (ENU convention), represents linear velocity, encodes orientation as a rotation matrix, and are gyroscope and accelerometer bias terms requiring online estimation.
The characteristics of all datasets used in the experiments are summarized in Table 1. Our design objectives for the final fused positioning output (after integrating LiDAR-inertial odometry with RTK-GNSS) encompass: (1) positioning RMSE below 5 cm over multi-kilometer trajectories; (2) degradation detection response within 100 ms; (3) positioning availability exceeding 95% across diverse environments; and (4) real-time operation with end-to-end latency below 100 ms. Note that the odometry-only results presented in Table 2 and Table 3 represent intermediate outputs before fusion, serving as baselines to demonstrate the effectiveness of our adaptive fusion strategy.
2.3. IEKF-Based LiDAR-Inertial Odometry
Our front-end odometer module is based on the IEKF framework [3] proposed in existing work, which has been adopted for its good performance in terms of computational efficiency and robustness. The IEKF operates on manifolds to properly handle the non-Euclidean structure of rotation representations.
Prediction Step: Upon receiving each IMU measurement (where and denote the measured angular velocity and linear acceleration), we propagate the state estimate forward in time:
where ⊞ denotes the manifold addition operator (a generalization of standard addition for differentiable manifolds, ensuring the result remains on the manifold), and is the continuous-time state evolution function derived from IMU kinematics. The corresponding covariance propagation follows:
where is the state transition Jacobian matrix, is the noise Jacobian matrix, and is the IMU noise covariance matrix.
Motion Compensation: During each LiDAR scan acquisition (about 100 ms), vehicle motion causes distortion in the point cloud. To correct this, we compensate the original point (measured at time ) to the reference coordinate system at the scan end time :
where denotes the compensated point, is the rotation from IMU frame at time to time , and represent the extrinsic calibration (rotation and translation) between the LiDAR and IMU frames, and is the relative position between the two IMU poses.
Update Step (Iterative Correction): For each motion-compensated point, we establish correspondence to the local plane structure in the accumulated map. The point-to-plane residual is constructed as
where is the unit normal vector of the corresponding plane in the world frame, and denote the predicted IMU orientation and position at time , and is the closest point on the corresponding plane.
The iterative state correction follows maximum a posteriori (MAP) estimation principles:
where represents the state correction on the tangent space, m is the total number of point-to-plane correspondences, is the observation Jacobian (computed by linearizing Equation (5) with respect to the state), and is the measurement noise covariance for the j-th correspondence. This minimization is solved iteratively until convergence, typically requiring 2–4 iterations.
2.4. Loop Closure Detection and Factor Graph Optimization
Laser-inertial odometers inevitably produce cumulative drift over long runs and long trajectories. To this end, the Stable Triangle Descriptor (STD) [16] is used for loop detection, which has the main advantage of strong invariance to geometric changes.
The STD method first extracts the key points based on the planar voxels, calculates the local covariance matrix, and selects the point with the largest fitting plane distance as the candidate key point. Subsequently, a triangle descriptor is constructed from key points adjacent to spatial locations:
where , , and denote the Euclidean distances between the triangular feature point pairs (1, 2), (2, 3), and (1, 3) respectively; , , and are the unit normal vectors of the local planes corresponding to the three triangular feature points; and represents the dot product of the normal vectors (with value range ), which characterizes the angular relationship between the planes.
Factor Graph Optimization: We implement backend optimization using GTSAM [17], incorporating three distinct factor types:
- IMU Pre-integration Factor: Constrains consecutive keyframes through pre-integrated inertial measurements accumulated between poses.
- LiDAR Odometry Factor: Encodes relative pose estimates from scan matching with associated covariance uncertainty.
- Loop Closure Factor: Connects revisited locations with STD-verified geometric transformations.
The resulting optimization problem,
is solved incrementally using the iSAM2 algorithm, enabling efficient updates as new factors arrive.
2.5. Eigenvalue-Based Degradation Detection
The quality of point cloud registration is inherently dependent on the strength of the constraints, and this property can be directly reflected in the conditionality of the Jacobian matrix. Based on this observation, this paper proposes a theoretical method for detecting degradation.
Theoretical basis: Considering the registration problem after linearization, it can be expressed as
Introducing an artificial constraint perturbs the solution by approximately
where represents the minimum eigenvalue. When is small, it means that the problem conditions are poor, and the solution results are highly sensitive to the measurement disturbance, and the stability of the solution is significantly reduced.
Dynamic threshold adaptive: Given the expected accuracy tolerance of , the corresponding adaptive threshold can be derived:
where is an empirically determined parameter that characterizes motion characteristics. This range was established through extensive testing across various motion patterns and environments, balancing between sensitivity to degradation and robustness against false positives.
2.6. Adaptive Fusion Strategy
The final localization estimate fuses IEKF odometry with RTK measurements through covariance-weighted integration.
Covariance Estimation: For IEKF odometry output, we estimate covariance from the registration residuals:
For RTK measurements, uncertainty derives from the horizontal dilution of precision:
Minimum Variance Fusion: The optimal combined estimate is as follows:
Mode-Based Adaptation: Based on the degradation index :
- Normal ( ): Weight ratio 0.7:0.3.
- Mild Degradation ( ): Adjust ratio to 0.4:0.6.
- Severe Degradation ( ): RTK-dominant mode or trigger relocalization.
Weight transitions employ exponential moving average filtering with a time constant ms to prevent positional discontinuities.
2.7. Global Relocalization
When both LiDAR severely degrades and RTK becomes unavailable, we employ hierarchical 2D–3D relocalization:
Stage 1—2D Coarse Localization: Project accumulated 3D point clouds to 2D occupancy representation, then apply correlative scan matching with branch-and-bound search acceleration.
Stage 2—3D Fine Alignment: Execute ICP refinement initialized by the coarse 2D pose estimate.
Stage 3—Validation: Accept candidate poses exhibiting ICP fitness scores exceeding a 0.6 threshold.
3. Results
3.1. Experimental Setup
Datasets: We conduct comprehensive experiments on two dataset categories:
- 1.KITTI Odometry Dataset [18]: Sequences 00, 05, 07, 09 featuring loop closures and ground truth trajectories, covering 65 km total distance across urban, residential, and highway scenarios.
- 2.Custom Field Datasets: Two sequences collected at Southeast University Suzhou campus: Campus (2.3 km), traversing mixed environments, including 5–8 story buildings and vegetation; and Outdoor (2.8 km), featuring dense tree canopy causing intermittent RTK signal outages. The satellite map projections of the custom dataset trajectories are shown in Figure 2.
Baseline Methods: A-LOAM [19], LIO-SAM [11], FAST-LIO2 [3], FASTER-LIO [20].
Hardware: NVIDIA Jetson Orin NX. The experimental robot platform with sensor configuration is shown in Figure 3.
3.2. Odometry Accuracy on KITTI Benchmarks
The performance gains observed in Figure 4 stem primarily from the interaction between the front-end and back-end modules. The IEKF tight coupling provides the necessary short-term accuracy, maintaining precise uncertainty propagation for the subsequent fusion steps. Moreover, the STD loop detection proved more effective than intensity-based methods in outdoor settings, boosting recall rates to 94% and supplying robust geometric constraints. The GTSAM backend then utilizes these constraints to distribute accumulated errors across the trajectory, correcting the global path.
3.3. Performance on Custom Datasets
Figure 5 presents the absolute pose error (APE) over time on both custom datasets, demonstrating that the proposed method maintains consistently low error levels throughout the trajectory. Figure 6 illustrates the point cloud map before and after loop closure optimization, confirming the effectiveness of the STD-based loop detection in eliminating accumulated drift.
3.4. Mapping Quality Assessment
Figure 7 and Figure 8 compare the point cloud mapping quality and vertical drift characteristics of the proposed method against baseline approaches on the campus dataset.
3.5. Multi-Sensor Fusion Performance
The fusion performance under three operating modes is summarized in Table 4 and Table 5. A deviation of 0.7–1.4 cm exists between experimental and theoretical results, attributable to time synchronization errors, calibration inaccuracies, and environmental factors.
3.6. Degradation Detection Evaluation
Table 6 and Figure 9 present the degradation detection performance and the corresponding real-time eigenvalue monitoring results, respectively.
3.7. Global Relocalization Performance
Table 7 presents the measured relocalization performance metrics against the design goals across multiple test scenarios.
3.8. System Latency Analysis
Table 8 provides a per-module breakdown of computational latency and CPU usage measured on the NVIDIA Jetson Orin NX embedded platform, confirming that the total end-to-end latency meets the real-time design requirement of below 100 ms.
4. Discussion
4.1. Key Findings
The experimental evaluation results show that the proposed system mainly reflects the following three characteristics: Firstly, the tightly coupled fusion strategy shows obvious performance advantages compared with the loose coupling scheme. By introducing more rigorous uncertainty modeling under the IEKF framework, including IMU pre-integration and online bias estimation, the system achieves significant improvements in odometer drift control, reducing drift error by approximately 50% compared to the FAST-LIO2 baseline method.
In addition to odometer accuracy, experimental results show that loop closure optimization plays a key role in overall performance even when absolute RTK constraints are introduced. The test found that the STD descriptor based on geometric information is more robust to lighting changes in outdoor scenes and is more conducive to maintaining the consistency of global trajectories than methods that rely on intensity information. In terms of system robustness, the proposed eigenvalue-based degradation detection method can accurately identify all degradation events in the dataset, with a false positive rate of less than 5%. At the same time, the combination of the detection mechanism and the gradual weight adjustment strategy enables the system to smoothly adapt to environmental changes, effectively inhibits the positioning mutation, and avoids the adverse effects on the control stability of the robot.
4.2. Comparison with Related Work
The proposed framework is clearly different from existing advanced methods in terms of adaptive capabilities and theoretical foundations. Unlike traditional methods such as LIO-SAM [11] that rely on fixed GPS factors, the proposed method does not perform fusion based on empirical rules, but dynamically adjusts the fusion weights based on the results of degradation analysis, and combines it with a closer iterative optimization process to achieve a localization error reduction of about 56% on the custom dataset.
In addition, the system offers significant advantages in terms of deployability. Different from learning methods that require a large amount of data collection and training for specific environments [21,22], the analytical framework proposed in this article does not rely on any prior training data and can be directly applied to a variety of heterogeneous scenarios. At the same time, the degradation evaluation indicators used by the system have clear mathematical explanations, which are helpful for problem location and system debugging and provide strong support for the strict requirements of autonomous driving systems in safety verification and engineering certification.
4.3. Ablation Study
Table 9 presents the ablation study results, demonstrating the individual contribution of each system component to overall localization performance.
4.4. Limitations
Although the experimental results verify the robustness of the system in various scenarios, there are still some limiting factors that need further discussion in practical applications. First, the system has a temporal validity dependence on the prior map used for relocation. Field tests show that when the map data update cycle exceeds three months, the relocation failure rate increases to about 5%. The main reason is that structural changes in the environment interfere with the matching process.
Second, computational scalability remains a concern in long-running tasks. Since the factor graph scale grows linearly with the trajectory length, the current system architecture can still maintain high efficiency within an operating range of about 10 km. However, at longer distances, it is necessary to introduce strategies such as sliding window optimization or map segmentation to control computational complexity. In terms of sensor fusion, the system can effectively mitigate the impact of short-term GNSS signal interruptions, but when the signal is lost for more than five minutes, the accumulated drift will eventually exceed the allowable error range of the design. In addition, existing frameworks model based on static environment assumptions, which may produce incorrect loop closure candidates in highly dynamic scenarios. This problem will be the focus of subsequent work and will be solved by introducing a dynamic target filtering mechanism.
5. Conclusions
This paper proposes and verifies an adaptive multi-sensor fusion framework and systematically designs the common reliability bottleneck problems of mobile robots in indoor and outdoor mixed environments. By combining an eigenvalue degradation detection method with a clear theoretical basis and a dynamic covariance weighted fusion strategy, this system can effectively alleviate the inevitable cumulative drift problem in pure LiDAR methods while reducing positioning instability caused by intermittent RTK signal loss. Experimental results on the KITTI benchmark data set covering 65 km and self-collected field data of 4.8 km show that the proposed IEKF-based tight coupling scheme and back-end optimization strategy can reduce the drift error by about 50% compared to the baseline method. More importantly, even under complex working conditions such as sensor performance degradation, the system can still maintain centimeter-level positioning accuracy and real-time operation performance on the embedded hardware platform.
In future work, we will further expand the research content from three aspects: First, to address the problem of limited timeliness of pre-built maps, an online map update mechanism will be introduced to support the long-term autonomous operation of the system; second, by integrating more advanced dynamic target filtering methods, the system’s robustness will be improved in scenarios with dense personnel and frequent environmental changes; finally, it is planned to integrate alternative absolute positioning methods such as UWB and visual landmarks to enhance the system’s reliable operation capabilities under long-term GNSS unavailability conditions.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Chen K. Nemiroff R. Lopez B.T. Direct Li DAR-Inertial Odometry: Lightweight LIO with Continuous-Time Motion Correction Proceedings of the 2023 IEEE International Conference on Robotics and Automation (ICRA)London, UK 29 May–2 June 202339833989
- 2Wen W. Bai X. Kan Y.C. Hsu L.T. Tightly Coupled GNSS/INS Integration via Factor Graph and Aided by Fish-Eye Camera IEEE Trans. Veh. Technol.201968106511066210.1109/TVT.2019.2944680 · doi ↗
- 3Xu W. Cai Y. He D. Lin J. Zhang F. FAST-LIO 2: Fast Direct Li DAR-Inertial Odometry IEEE Trans. Robot.2022382053207310.1109/TRO.2022.3141876 · doi ↗
- 4He D. Xu W. Chen N. Kong F. Yuan C. Zhang F. Point-LIO: Robust High-Bandwidth Light Detection and Ranging Inertial Odometry Adv. Intell. Syst.20235220045910.1002/aisy.202200459 · doi ↗
- 5Zheng C. Xu W. Zou Z. Hua T. Yuan C. He D. Zhou B. Liu Z. Lin J. Zhu F. FAST-LIVO 2: Fast, Direct Li DAR-Inertial-Visual Odometry IEEE Trans. Robot.20254132634610.1109/TRO.2024.3502198 · doi ↗
- 6Zhang J. Kaess M. Singh S. On Degeneracy of Optimization-Based State Estimation Problems Proceedings of the 2016 IEEE International Conference on Robotics and Automation (ICRA)Stockholm, Sweden 16–21 May 2016809816
- 7Li Z. Huang J. GPS Surveying and Data Processing Wuhan University Press Wuhan, China 2016
- 8Cao S. Lu X. Shen S. GVINS: Tightly Coupled GNSS-Visual-Inertial Fusion for Smooth and Consistent State Estimation IEEE Trans. Robot.2022382004202110.1109/TRO.2021.3133730 · doi ↗
