Adaptive Kalman Filter-Based SLAM in LiDAR-Degenerated Environments
Ran Ma, Tao Zhou, Liang Chen

TL;DR
This paper introduces a new SLAM method using adaptive Kalman filters and particle filters to improve robot positioning accuracy in environments where LiDAR data is unreliable.
Contribution
The novel SLAM method combines adaptive Kalman filtering with particle filter optimization to enhance robustness in LiDAR-degenerated environments.
Findings
The proposed method improved positioning precision by 61.3–97.9% compared to Karto SLAM.
Positioning accuracy was enhanced by 35.7–99.0% compared to Hector SLAM.
The method achieved 43.8–93.0% better performance than Cartographer.
Abstract
Owing to the low cost, small size, and convenience for installation, 2D LiDAR has been widely used in mobile robots for simultaneous positioning and mapping (SLAM). However, traditional 2D LiDAR SLAM methods have low robustness and accuracy in LiDAR-degenerated environments. To improve the robustness of the SLAM method in such environments, an innovative SLAM method is developed, which mainly includes two parts, i.e., the front-end positioning and the back-end optimization. Specifically, in the front-end part, the AKF (adaptive Kalman filter) method is applied to estimate the pose of the mobile robot, zero bias of acceleration and gyroscope, lever arm length, and the mounting angle. The adaptive factor of the AKF can dynamically adjust the variance of the process and measurement noises based on the residual. In the back-end part, a particle filter (PF) is employed to optimize the pose…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9- —Coal-Major Project
- —Major Science and Technology Project of Jiangsu Province
- —Key Research and Development Program of Hubei Province
- —Guangdong Science and Technology Department
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Inertial Sensor and Navigation · Advanced Vision and Imaging
1. Introduction
Simultaneous localization and mapping (SLAM) technology was first proposed by Smith in 1986 [1]. There is an extensive range of applications of SLAM technology, especially in robotics [2,3]. The SLAM methods mainly include visual-SLAM and LiDAR-SLAM. Conventional vision-based approaches often rely on RGB or depth imagery; they can be susceptible to varying lighting conditions, occlusions, and limited depth resolution. Recent advancements have explored the integration of three-dimensional data representation and deep learning to overcome these constraints. Notably, Zhang et al. [4] proposed a methodology to combine the 3D graph deep learning and laser point cloud for intelligent rehabilitation. By fusing laser point cloud data with 3D graph convolutional networks, this method achieves robust spatial feature extraction and segmentation accuracy in complex environments. However, as noted in comparative studies, while such visual-based methods are cost-effective and rich in texture information, they fundamentally depend on ambient lighting conditions and distinct environmental features. They are susceptible to challenges such as photometric calibration errors and rolling shutter effects, and can experience significant performance degradation or even tracking failure in low-light, low-texture, or highly dynamic scenes [5,6,7,8]. In contrast, LiDAR-based SLAM provides direct, high-precision geometric measurements of the environment, which are largely invariant to lighting changes and yield accurate metric maps, though they may lack semantic richness and face challenges in feature-sparse environments.
The LiDAR sensors are classified into 2D and 3D LiDARs. Two-dimensional LiDAR captures data on a signal plane, making it ideal for autonomous mobile robots (AMRs). The application benefits from 2D LiDAR’s lower cost, reduced complexity, and lower data volume, thus simplifying data processing and increasing response times for basic navigation. Three-dimensional LiDAR captures data in three dimensions, providing a comprehensive view essential for applications such as autonomous vehicles. Three-dimensional LiDAR generates millions of data points per second, offering highly detailed environmental mapping crucial for precise navigation and complex tasks. The huge data volume of 3D LiDAR means complex data processing and slower response times for navigation. Thus, compared with 3D LiDAR, 2D LiDAR has been widely used on the AMRs for position and navigation.
The main existing methods for LiDAR SLAM include LiDAR-only SLAM and multi-sensor fusion SLAM. LiDAR-only SLAM methods include direct matching methods and feature matching methods. Direct matching methods directly utilize LiDAR scan points for pose estimation, and direct matching methods include matching based on geometric models and maps. For the matching methods based on geometric models, iterative closest point (ICP) and its improvements are the most popular methods [9]. The improved aspects of the ICP’s improvements include subset sampling (e.g., voxel grid filtering [10], histogram sampling, etc.), distance metrics (e.g., point-to-line [11], etc.), and computational efficiency (e.g., Fast ICP [12], Anderson Acceleration ICP [13], etc.). Although the improvements in the ICP make it faster and more accurate than the original algorithm, these methods still have a high dependence on the initial value, which makes it easy to fall into local extrema. The methods based on matching maps match the LiDAR scan points with the grid map or the submaps. The grid map and submap are constructed based on LiDAR scan points. Popular methods based on matching maps have included the Gmapping method [14] and the Cartographer method [15] until now. The map offers richer structural information than sparse individual scans. Thus, compared with the methods based on matching LiDAR scan points, methods based on matching maps have a higher matching accuracy. However, there is still the issue of it being easy to fall into the local extrema if the features of the environment are similar or the initial value is not accurate enough. Feature-based methods utilize the extracted fundamental geometric properties, such as lines, corners, curves, etc., to match and reduce the time consumed. The LiDAR odometry and mapping (LOAM) algorithm [16] is a typical LiDAR SLAM framework that extracts 3D planar and edge points. The optimization metric of the LOAM is to minimize the distance between point-to-line and point-to-plane. However, the feature-based methods also have a high dependence on the initial value, and the feature extraction also influences the optimization results. The introduction of the multi-sensor fusion SLAM is to improve the accuracy and reliability of the initial value and reduce the probability of falling into local extrema. The mainstream multi-sensor fusion SLAM methods are LiDAR-inertial methods. The LiDAR-inertial fusion methods include inertial measurement unit (IMU) odometry and IMU-wheel odometry-aided methods. IMU-aided SLAM methods utilize the IMU measurements to correct the non-linear motion distortions of the LiDAR scans and use the IMU pre-integration results as the initialization of the scan matching to improve the robustness [17]. However, there are still some problems with IMU-aided SLAM methods in the mobile robots’ application. The mobile robots move at slow speeds with frequent starts and stops. Under such conditions, an accurate zero velocity update (ZUPT) is necessary for the IMU odometer, but it is difficult to judge the zero velocity status and estimate linear velocity accurately only based on a single IMU [18]. Thus, it still has low robustness when the IMU-aided SLAM methods work in LiDAR-degenerated environments, such as corridor environments. To solve this problem, the wheel odometer is introduced into the IMU-aided SLAM methods. For example, G.P.C. Júnior et al. [19] proposed the EKF-LOAM (extended Kalman filter-LiDAR odometry and mapping) method to solve the issues that the traditional LiDAR SLAM algorithms have difficulty mapping in the common industrial confined spaces, such as ducts and galleries, which have long and homogeneous structures. The experiments show that the EKF-LOAM method improves the positioning accuracy by more than 50% compared with the original LeGO-LOAM (lightWeight and ground optimized-LOAM) algorithm. However, this work utilized the 3D LiDAR in the proposed methods, and it did not explore the multi-sensor fusion methods based on 2D LiDAR. Thus, we proposed a method fusing IMU, wheel odometer, and 2D LiDAR to improve the robustness of 2D LiDAR SLAM in LiDAR-degenerated environments and address the challenge of high dependence on external environmental features for LiDAR SLAM methods. The main contributions of this work are as follows:
- A new adaptive Kalman filter (AKF) method is proposed to fuse IMU and the wheel odometer, which can estimate IMU’s acceleration and gyroscope zero biases, the mounting angle, and the lever arm length. The adaptive factor of the AKF can dynamically adjust the variance of the process noise and measurement noise based on the residual.
- In the back-end, the pose from AKF is introduced as constraints into the particle filter (PF) to overcome the mismatch, which commonly occurs in scan-map matching, especially under LiDAR-degenerated environments.
- The field tests show that the proposed method can provide a reliable and robust positioning and mapping service in LiDAR-degenerated environments, compared with the traditional 2D LiDAR SLAM methods (Karto SLAM, Hector SLAM, and Cartographer).
The organization of the paper is as follows: Section 2 mainly introduces the related works. Section 3 mainly introduces the system framework of the proposed method. Section 4 discusses the context of the adaptive Kalman filter, and Section 5 discusses the context of the back-end SLAM part. Section 6 introduces experimental results, and Section 7 discusses the results of the experiments. Section 8 concludes the paper and provides an outlook on future research directions.
2. Related Works
The prior works on LiDAR SLAM in LiDAR-degenerated environments are extensive. The works mainly include multi-sensor fused SLAM and utilizing artificial intelligence (AI) SLAM for point cloud registration or multi-sensor fusion. In this section, we briefly reviewed the work on these two aspects.
2.1. Multi-Sensors Fusion SLAM
The current LiDAR SLAM methods have proven to be accurate and robust enough in many environments, and here we focus on the performance of the LiDAR SLAM method in LiDAR-degenerated environments. Regarding these environments, researchers have found that adding auxiliary sensors or assisting geometric landmarks can improve the accuracy and robustness of LiDAR SLAM methods in LiDAR-degenerated environments.
Adding auxiliary sensors means introducing additional constraints. Cameras have proven to be good auxiliary sensors in some LiDAR degenerated environments or feature-sparse environments [20,21] but are still not accurate enough in LiDAR-degenerated environments, and the methods fusing cameras need more time than the traditional LiDAR SLAM methods. Ultra-wideband (UWB) technology is an environment-insensitive sensor, and the positioning error of the UWB does not accumulate with the increase in mileage. Thus, LiDAR SLAM fusing the UWB has attracted more and more interest in recent years [22]. However, for the LiDAR SLAM methods fusing the UWB, it is necessary to place the UWB stations in advance, and it needs large enough scenarios to build a good enough geometric configuration for the positioning methods based on UWB. LiDAR SLAM methods fusing inertial sensors are another mainstream method that can improve the accuracy and robustness of the LiDAR SLAM methods in LiDAR-degenerated environments. Qing et al. [23] utilized IMU to aid laser scan matching and used the positioning results of the IMU-aided scan matching methods to transform the laser scan, finally, matching the laser scan with the orthogonal weighted occupancy likelihood map to obtain the final positioning results. This method has a high accuracy in the library, but it does not discuss the performance of the proposed method in LiDAR-degenerated environments, such as corridors. Chen et al. [24] utilized factor graph optimization to fuse the IMU and wheel odometer and used the positioning results to transform the laser scan to map coordinate systems. Reference [24] also utilized the visual landmarks to improve the accuracy. This method has high accuracy in LiDAR-degenerated environments but is highly dependent on visual landmarks.
Beyond these specific implementations, the factor graph optimization framework has become the standard method for achieving tightly-coupled, high-precision multi-sensor fusion in modern SLAM systems [25,26]. This framework elegantly unifies heterogeneous measurements—including IMU pre-integration, LiDAR odometry, loop closures, and absolute measurements from sources like GPS or landmarks as probabilistic constraints within a unified graph. Representative works, such as LIO-SAM [25] and reference [26], demonstrate the strength of this approach by deeply integrating LiDAR features with IMU data in a tightly-coupled manner, achieving state-of-the-art accuracy and robustness. The primary advantage of factor graph-based fusion lies in its flexibility and ability to perform global batch or sliding-window optimization, which optimally balances information from all sensors over time. However, this comes at the cost of growing computational complexity with the trajectory length, posing challenges for resource-constrained platforms or lifelong operation, where real-time performance is critical.
2.2. AI-Based SLAM
With the development of AI, using artificial intelligence models or data-driven approaches to solve the LiDAR-based SLAM problem has become a new trend. For AI-based SLAM, C. Li et al. [27] utilized the recurrent convolutional neural network (RCNN) to fuse IMU and 2D LiDAR and used an ICP-based scan-to-submap method to optimize the pose. This method has a higher accuracy compared with the traditional Hector SLAM. Nicolai et al. [28] utilized convolution neural networks (CNNs) to reduce the state space of laser scans and address the challenge of real-time positioning in large environments. Deng et al. [29] introduced the point pair feature network (PPNET), which leverages deep learning to process global 3D point clouds to solve the issue of finding correspondences in unorganized point clouds. Vongkubihisal et al. [30] presented the inverse composition discriminative optimization for 3D point cloud matching to solve the problem that traditional local point cloud registration methods are often affected by the presence of noise, outliers, and poor initialization. These all prove that AI technology has injected new vitality into the SLAM methods, but a large enough dataset is needed to train the model. Moreover, until now, AI-based methods still cannot reach the accuracy and reliability of traditional methods in some conditions.
3. System Overview
The proposed system includes data pre-processing, front-end positioning, and back-end optimization. The proposed method’s input includes an IMU, wheel odometer, and 2D LiDAR, and its output is the mobile robot’s pose and a global map. Figure 1 shows the system overview. The data pre-process step filters the raw sensor data, which includes three parts: (1) filtering the wheel speed based on the bilateral filter [31]; (2) filtering the IMU based on the low-pass filter [32,33]; (3) downsampling the laser point cloud using the voxel filter [34].
Regarding the front-end positioning, we utilized AKF to fuse the IMU and wheel odometer and estimate the mobile robot’s pose. The positioning results from the front-end are not accurate enough to enable map construction. To improve the positioning accuracy and build the map, we utilized the scan-map matching method to estimate the accurate yaw angle and position and used the particle filter to fuse these results with the front-end positioning data. To improve the system’s robustness in complex environments, we introduced the pose domain constraint to the back-end SLAM.
4. Adaptive Kalman Filter
4.1. System Model
The system model in this work can be expressed as
where is the state vector, and is the measurement vector, including the linear velocity observation from the wheel odometer. is the transforming function, and is the observation function. and are the process and measurement noise, respectively, and they are assumed to be zero-mean Gaussian white with covariance matrix , , i.e., and .
The state vector is defined as
where n represents the navigation coordinate system, and b denotes the body coordinate system. is the three-axis position in the n system, and is three-axis velocity in the n system. represents the attitude angle. represents zero bias of the IMU gyroscope. is zero bias of IMU acceleration. here represents the IMU mounting lever arm, and is the IMU mounting angle in the body coordinate system. Based on the IMU preintegration, is expressed as follows:
where is the position at the kth epoch, and is the position at the th epoch. denotes the velocity at the th epoch, and denotes the velocity at the kth epoch. and are rotation matrices representing the transformation from the body coordinate system to the navigation coordinate system and from the IMU coordinate system to the body coordinate system, respectively. can be transformed from the Euler angle , and can be transformed according the Euler angle . denotes the acceleration measurements at the kth epoch, and denotes the zero bias of the acceleration at the th epoch. M represents the scale matrix for tangential and centripetal acceleration due to rotation, which can be estimated by the measurement of the IMU gyroscope, and the estimation process can be expressed as (12). represents the mounting arm of the IMU at the th epoch, and represents the earth gravity vector. is the attitude angle at the kth epoch, and is the attitude angle at the th epoch. denotes the gyroscope measurements at the kth epoch, and denotes the zero bias of the gyroscope at the th epoch.
where is the scale factor of tangential acceleration due to rotation motion, and denotes the scale factor of centripetal acceleration due to rotation motion. represents the time derivatives of the angular velocities. The measurement vector can be modeled as
where is the linear velocity measurement provided by the wheel odometer at the kth epoch. denotes the quaternion transforming from the body frame to the navigation frame at the kth epoch. Thus, the observation function can be expressed as matrix H, which is as follows:
To sum up, the problem of estimating the position and velocity of the mobile robot is to infer the current mobile state from the observation sequence . Within the framework of Bayesian inference, the problem corresponds to computing the marginal posterior . It has been demonstrated that the Kalman filter (KF) can achieve optimal estimation in a linear state model with Gaussian white noise. However, in (5), the transforming process of the state components of the gyroscope zero bias and the IMU mounting angle is nonlinear, which limits the application of KF. Hence, in our state estimation problem, we resorted to the EKF method, which can effectively solve the non-linear optimization problem [35]. In addition, the uncertainty for the covariance of the processing noise and measurement noise will influence the robustness and accuracy of the filter, so the adaptive factors are introduced into the EKF model to obtain the adaptive Kalman filter (AKF). In this section, the AKF will be detailed.
4.2. State Estimation with AKF
The AKF includes two steps, and it can be summarized as follows:
4.2.1. State Prediction
Based on the state model, the state prediction can be calculated as
where represents the prior state estimation at the kth epoch, and represents posterior state estimation at the th epoch. denotes the state transformer matrix at the kth epoch, and denotes the adaptive factor that is utilized to adjust the processing noise covariance matrix . is the prior state error covariance matrix at the kth epoch, and is the posterior state error covariance matrix at the th epoch. represents the processing error transform matrix at the th epoch. The adaptive factor is expressed as
where L is the length of the sliding window, and is the weight of the adaptive element . b denotes the Suge forgetting factor, and b is satisfied with . The state transform matrix , processing noise covariance matrix , and the processing error transform matrix are expressed as
where denotes the error sigma of the acceleration measurements, denotes the error sigma of the gyroscope measurements. represents the error sigma of the zero bias of the acceleration, and represents the error sigma of the zero bias of the gyroscope. is the error sigma of the IMU mounting angle, and is the error sigma of the IMU mounting lever arm. M represents the scale matrix for tangential and centripetal acceleration due to rotation, which is expressed as (12). denotes the acceleration measurement, and denotes the gyroscope measurement.
4.2.2. State Update
The state updated is calculated as follows:
where represents the Kalman filter gain, and H represents the measurement transform matrix, which is expressed as . denotes the adaptive factor that is utilized to adjust the error covariance matrix of the measurements R, and is expressed as
where L represents the length of the sliding window, and represents the weight of the adaptive element , and it is expressed as .
5. PF in the Back-End Optimization
In the back-end optimization, we further improved the positioning accuracy from the front-end and built the grid probability map based on PF. The particle state is modeled as
where is the particle j state at kth epoch. is the 2D position of the particle j at kth epoch. is the yaw of the particle j at the kth epoch. The particle sampling is modeled as
where denotes the predicted particle j state at the kth epoch, and denotes the particle j state at the th epoch. represents the odometer measurement from the front-end, and represents the odometer measurement noise, which is generated as a zero-mean Gaussian distribution. is expressed as
where represents the move distance from the front-end, and it can be expressed as (34). denotes the yaw change from the front-end, and it can be expressed as (35).
where is the mobile position in m system at the kth epoch, and is the mobile position in m system at the th epoch.
where represents the robot’s yaw at the kth epoch, and represents the robot’s yaw at th epoch. and are from and , separately. Normally, to improve the accuracy of the particle pose, the predicted particle pose is to be optimized by a scan-map matcher, and the optimization is based on the ICP method [9,36]. The optimized particle pose is denoted as . The particle weight is calculated as
where represents the standard deviation of the LiDAR data, which is normally equal to the resolution of the grid map. denotes the mean coordinate of the historical LiDAR points in the ith grid. denotes the LiDAR point at the current frame, which falls within the ith grid. is obtained by transforming the original laser point from b system to m system based on the particle j state , and the transforming process can be expressed as
where is the original laser point, and is the transformed laser point. The particle quantity is calculated as
where J is the particles’ number, and is the normalized weight of the particle j. We resampled the particle when the drops below the threshold . The optimized results of the PF are as
To improve the robustness, the pose from AKF is introduced as the constraint, and the process is as
where denotes the move distance from PF, and denotes the yaw change from PF. and are thresholds. After (40) and (41), the PF results will be updated according to (39), and the grid map will be extended based on .
6. Experimental Results
6.1. Experiments Settings
6.1.1. Experiment Platform
To evaluate the proposed method’s positioning performance, a six-wheel robot has been utilized for data recording, which is equipped with an IMU, wheel odometer, and 2D LiDAR. The sensor configurations are shown in Table 1. A GS-100G laser scanner, which is from the GEOSUN company in China, provides the reference trajectory. According to [37], the horizontal position accuracy is 0.02 m, and the elevation position accuracy is 0.03 m, which illustrates that the accuracy of the reference trajectory is satisfactory with the experimental evaluation. The overall experiment platform is shown in Figure 2.
6.1.2. Experiment Scenarios
The experiments were carried out in four scenarios, which include a corridor, hall, park, and playground, and the experiment scenarios are shown in Figure 3. As shown in Figure 3a, the corridor is approximately 3 m wide and 50 m long, and the valid measurement range of the 2D LiDAR is less than the length of the corridor, which might be a LiDAR degenerated environment. As shown in Figure 3b, the hall is approximately 14 m wide and 18 m long, and there are rich geometric features and a good closure. As shown in Figure 3c, the parking lot belongs to a semi-open environment. The playground is approximately 32 m wide and 50 m long, and the scenario is relatively open, with sparse geometric objects, as shown in Figure 3d. The selected playground also belongs to the LiDAR-degenerated environment. Recording for the quantitative standards for LiDAR degradation environments, this section is based on the following [38]:
where represents the coordinate of each point in the current laser scan, and N represents the number of points in the current laser scan. is the coordinate of each point in the centroid coordinates of the entire point cloud. In actuality, the range of k is , and it is mapped to the interval to obtain the degeneration degree . is the scale value, which is set as 5 in this work. When the value of is close to 1, it demonstrates a higher degree of degeneration. The mean values of in the selected scenarios are counted in Table 2.
To make the degree of degeneration more obvious, the values of Table 2 are normalized to range among Scenarios 1–4. According to Table 2, it is found that Scenario 1 and Scenario 4 have a higher degree of degeneration compared with that of Scenario 2 and Scenario 3.
6.1.3. Evaluation Metrics
The evaluation metrics include the maximum, mean, and median errors, as well as the root mean square error (RMSE), which is calculated as
where represents the estimated position. is the reference position. K denotes the number of epochs.
6.1.4. System Parameter Settings
In this work, the Q is modeled as
where represents the noise sigma of the zero bias of the gyroscope, and represents the noise sigma of the zero bias of the accelerometer. denotes the noise sigma of the IMU mounting lever arm, and denotes the noise sigma of the IMU mounting angle. R is modeled as
where refers to the noise sigma of the linear velocity measurement from the wheel odometer.
In this work, the parameter settings are as follows: , , , , and .
About the setting of the noise sigma of the accelerometer ( ) and the gyroscope’s zero biases ( ), they are set according to the zero bias stability. According to Table 1, is set as 0.343 (3500 g), and is set as rad/s ( h). In this work, the IMU mounting angle and IMU mounting lever arm are obtained from the external parameter measurements. The accuracy of the distance measurement is about 0.01 m, and the angle measurement’s accuracy is about . Thus, is set as , is set as 0.01 m. is modeled as
where P is the number of pulses rotated in a circle, is the time interval of the sampling. denotes the measurement error of the wheel’s diameter. In this work, P is 360, is 0.01 s, and is set as 0.01 m. Thus, is set as about 0.0017 m/s. Regarding the PF parts, the particle counts are set to 30, the threshold is set to 1 m, and is set to 30°.
6.2. Positioning and Mapping Results
The algorithm comparison is shown in Table 3. The compared algorithms include Karto SLAM, Hector SLAM, and Cartographer. The Karto SLAM, Hector SLAM, and the proposed method utilized the AKF as the front-end, and the Cartographer utilized pre-integration as the front-end.
In this section, Karto SLAM is selected with the melodic-devel version, Hector SLAM is selected with Version 0.4.1, and Cartographer is selected with Cartographer_ros master. These algorithms are running on the computer with an Intel Core i7-13700KF, and random access memory with 32 GB. The environment of the computer is Ubuntu-18.04, and the version of ROS is Melodic.
6.2.1. Mapping Results
From Figure 4, Figure 5, Figure 6 and Figure 7, we observed that the map from the proposed method has the highest accuracy compared with Karto SLAM, Hector SLAM, and Cartographer. In Scenario 1 and Scenario 4, it is observed that Karto SLAM, Hector SLAM, and Cartographer failed to build the map and localization, but the proposed method has built a relatively accurate map and provided a reliable positioning service, which suggests that the proposed method can provide a reliable mapping and positioning service in the LiDAR-degenerated environments. Figure 5a illustrates that the map built by Karto SLAM has many ghosting areas, which might result from the Karto SLAM utilizing window search during iteration. Figure 6b illustrates that Hector SLAM failed to build the map in Scenario 3, and that the yaw from Hector SLAM has obvious errors. Hector SLAM is based on the Gauss–Newton method to match the scan and grid map. When there are not enough features of the external environment, the Gauss–Newton method falls into the local optimal problem.
6.2.2. Positioning Results
Figure 8 illustrates the positioning results, which show that the proposed method is closest to the reference trajectory compared with other algorithms. It is observed that the Karto SLAM fails to position in Scenario 1, and there are many burrs in the trajectory of the Karto SLAM in Scenario 2. It is suggested that Karto SLAM has some limitations in practical application. The back-end optimization of Karto SLAM is based on matching submaps, and the optimization method utilizes a multi-scale grid search approach. Compared with the matching laser scan and grid map, the matching submaps method can enrich the features of the scan to some extent, but the multi-scale grid search approach has some limitations; too large an initial search step size will result in the mis-optimization problem. The attitude estimation of Hector SLAM has obvious errors in Scenario 3. The Hector SLAM is based on matching the scan and grid map, and the optimization method is based on the Gauss–Newton method to estimate the robot position and the yaw . When there are not enough features of the external environment, the Gauss–Newton method easily diverges, especially in the yaw estimation, because yaw estimation is a non-linear problem. In Scenario 4, the proposed method can only provide a reliable positioning service, which suggests that the constraint information from AKF can obviously improve the robustness of the back-end optimization in LiDAR-degenerated environments.
Figure 9 demonstrates the positioning errors, and Table 4 illustrates the positioning error static results. It is observed that the proposed method has the highest positioning accuracy in scenarios 1–4. According to Table 4, based on the RMSE, it is found that the positioning accuracy of the proposed method is improved by approximately 61.3–97.9%, 35.7–99.0%, and 43.8–93.0% compared to Karto SLAM, Hector SLAM, and Cartographer, respectively. Especially in Scenario 1, the proposed method improved the positioning accuracy by about 86.6%, 75.4%, and 77.2% compared to Karto SLAM, Hector SLAM, and Cartographer, respectively. In Scenario 4, the positioning accuracy of the proposed method is improved by about 97.9%, 99.0%, and 93.0% compared to Karto SLAM, Hector SLAM, and Cartographer, respectively. It is suggested that the proposed method has obviously improved the robustness of the system in the LiDAR-degenerated environments.
7. Discussion
The experiment results indicate that the proposed method has a relatively high robustness in LiDAR-degenerated environments. In the selected environments, the compared algorithms, i.e., Karto SLAM, Hector SLAM, and Cartographer, still have some problems. Scenario 1 is a long corridor, which belongs to a typical LiDAR-degenerated environment. In this scenario, the compared methods all failed to provide a reliable positioning service. The typical problem is that the LiDAR-odometry is less than the actual distance, which is due to the features of the external environment being highly similar, resulting in the error state estimation. Scenario 2 is an indoor hall, which has rich environmental features. Under this case, Hector SLAM and Cartographer have very high performance, but Karto SLAM is not ideal. Karto SLAM optimizes the pose based on the multi-scale search window method. An unreasonable step setting easily results in the local optimization problem. Scenario 3 is a parking lot, which includes many moving objects. Field tests illustrate that Karto SLAM and Cartographer have good performance, but Hector SLAM fails. Hector SLAM is based on the Gauss–Newton method to optimize the pose. Normally, this optimization method has relatively high accuracy, but when the environmental features are not rich enough, or there are too many moving obstacles, it can easily result in mis-optimization, especially in angular estimation. Scenario 4 is a playground, which also belongs to a LiDAR-degenerated environment, but compared with Scenario 1, it has more sparse features. It means that it is more difficult to obtain a reliable positioning estimation according to the LiDAR-odometry. In this section, the parameters of the proposed method are evaluated, including the length of the sliding window for the adaptive factors L, and the comparison between the PF-only and AKF+PF.
7.1. Influence of the Sliding Window Length L
In this section, we analyze the impact of the sliding window length L on positioning accuracy, with the root mean square error (RMSE) employed as the static evaluation metric. In this work, the sliding window length L is set to 1000 by default, while values of 50, 100, 500, 1000, 1500, and 2000 are also evaluated. The corresponding RMSE results are summarized in Table 5. It can be observed that the sliding window length L has a relatively limited influence on positioning accuracy, indicating low error sensitivity to this parameter.
7.2. Comparison Between the PF-Only and AKF+PF
In this section, a comparison is made between the PF-only and AKF+PF, with the performance evaluation metric primarily based on the positioning RMSE. The comparison results are tabulated, and the findings are presented in Table 6.
It is found that the AKF+PF has a higher positioning than that of PF-only in the selected scenarios, and the positioning accuracy is improved by about 90.4%, 52.6%, 60.9%, and 97.5%, respectively. It suggests that the fusion of the AKF can improve the positioning accuracy and robustness of the system. The reason for that is that the AKF can provide a reliable estimation of the robot’s pose, and this estimation can avoid mis-optimization during PF optimization, which is obviously after the LiDAR-degenration environments, such as Scenario 1 and Scenario 4.
8. Conclusions
In this paper, an innovative 2D-LiDAR SLAM is proposed to solve the issue that the traditional 2D-LiDAR SLAM method has difficulty in providing a reliable positioning service in LiDAR-degenerated scenarios. This method estimates the rough position by fusing the IMU and wheel odometer, and the fusing method is based on the AKF. The rough position will be utilized to predict the particle pose and constrain the back-end optimization. Compared to Karto SLAM, Hector SLAM, and Cartographer, the proposed method improves positioning accuracy by approximately 61.3–97.9%, 35.7–99.0%, and 43.8–93.0%, respectively. It is suggested that, in complex environments, such as LiDAR-degenerated environments like Scenario 1 and Scenario 4, the proposed method is still effective to provide a reliable positioning service. However, there are still some limitations for the proposed method, such as the fact that the wheel odometer can provide accurate linear velocity measurements, but the accuracy of the linear velocity measurement will decrease when the vehicle skids. In addition, the robustness of the proposed method is still low due to the limitation of the single 2D LiDAR’s performance. In the future, a new multi-sensor fusion positioning method will be researched to overcome the problem of vehicle skidding, and the fusion of the multi-2D LiDARs or 3D LiDAR will also be explored to further improve the system’s robustness and build the 3D map.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Smith R.C. Cheeseman P. On the representation and estimation of spatial uncertainty Int. J. Robot. Res.19865566810.1177/027836498600500404 · doi ↗
- 2Jung J. Lee S.-M. Myung H. Indoor Mobile Robot Localization and Mapping Based on Ambient Magnetic Fields and Aiding Radio Sources IEEE Trans. Instrum. Meas.2015641922193410.1109/TIM.2014.2366273 · doi ↗
- 3Yin J. Luo D. Yan F. Zhuang Y. A Novel Lidar-Assisted Monocular Visual SLAM Framework for Mobile Robots in Outdoor Environments IEEE Trans. Instrum. Meas.202271850391110.1109/TIM.2022.3190031 · doi ↗
- 4Xing Z. Ma G. Wang L. Yang L. Guo X. Chen S. Toward Visual Interaction: Hand Segmentation by Combining 3-D Graph Deep Learning and Laser Point Cloud for Intelligent Rehabilitation IEEE Internet Things J.202512213282133810.1109/JIOT.2025.3546874 · doi ↗
- 5Messbah H. Emharraf M. Saber M. Robot indoor navigation: Comparative analysis of Li DAR 2D and visual SLAM IAES Int. J. Robot. Autom.2024132089485610.11591/ijra.v 13i 1.pp 41-49 · doi ↗
- 6Zheng Z. Su K. Lin S. Fu Z. Yang C. Development of vision–based SLAM: From traditional methods to multimodal fusion Robot. Intell. Autom.20244452954810.1108/RIA-10-2023-0142 · doi ↗
- 7Yang N. Wang R. Gao X. Cremers D. Challenges in Monocular Visual Odometry: Photometric Calibration, Motion Bias and Rolling Shutter Effectar Xiv 20181705.0430010.1109/LRA.2018.2846813 · doi ↗
- 8Chen P. Zhao X. Zeng L. Liu L. Liu S. Sun L. Li Z. Chen H. Liu G. Qiao Z. A Review of Research on SLAM Technology Based on the Fusion of Li DAR and Vision Sensors 202525144710.3390/s 2505144740096278 PMC 11902412 · doi ↗ · pubmed ↗
