A Coarse-to-Fine Framework with Curvature Feature Learning for Robust Point Cloud Registration in Spinal Surgical Navigation
Lijing Zhang, Wei Wang, Tianbao Liu, Jiahui Guo, Bo Wu, Nan Zhang

TL;DR
This paper introduces a new framework for aligning 3D scans in spinal surgery, improving accuracy and robustness despite noise and poor overlap.
Contribution
A coarse-to-fine registration framework with curvature feature learning for robust point cloud alignment in spinal navigation.
Findings
The proposed method achieves average rotation and translation errors of 0.34° and 0.27 mm in noise-free conditions.
Under noisy conditions, the method shows average rotation and translation errors of 7.28° and 9.08 mm.
The algorithm outperforms existing methods in accuracy, speed, and robustness on pre- and intra-operative datasets.
Abstract
In surgical navigation-assisted pedicle screw fixation, cross-source pre- and intra-operative point clouds registration faces challenges like significant initial pose differences and low overlapping ratio. Classical algorithms based on feature descriptor have high computational complexity and are less robust to noise, leading to a decrease in accuracy and navigation performance. To address these problems, this paper proposes a coarse-to-fine registration framework. In the coarse registration stage, a Point Matching algorithm based on Curvature Feature Learning (CFL-PM) is proposed. Through CFL-PM and Farthest Point Sampling (FPS), the coarse registration of overlapping regions between the two point clouds is achieved. In the fine registration stage, the Iterative Closest Point (ICP) is used for further optimization. The proposed method effectively addresses the challenges of noise,…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9- —Beijing Natural Science Foundation
- —Beijing Municipal Education Commission’s Research Program
- —Chinese Institutes for Medical Research, Beijing
- —National Natural Science Foundation of China
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Imaging and Analysis · Robotics and Sensor-Based Localization · Medical Image Segmentation Techniques
1. Introduction
Pedicle screw fixation is a common spinal surgical method for treating spinal fractures, lumbar spondylolisthesis and other diseases [1]. However, due to the spine’s complex structure and high precision requirements, the procedure is challenging for surgeons. Surgical navigation systems improve accuracy and safety by providing precise positioning, and are widely adopted in neuro-navigation [2,3,4], spinal [5,6], laparoscopic [7,8], knee replacement [9,10], and mandibular surgeries [11,12], playing an increasingly important role in various fields [13]. Registration is one of the core technologies in surgical navigation systems, and its precision is an important factor affecting the accuracy of the navigation system [14]. Therefore, many scholars are actively engaged in research to improve the accuracy of registration in navigation systems with a view to further optimizing the performance of the system [15,16,17]. In the study of registration applied to neuro-surgical navigation systems [15], surface registration based on automatic machine learning is proposed, which can improve the registration accuracy through experimental verification. In total knee arthroplasty assisted by a surgical navigation system [16], a point-to-surface iterative closest point registration algorithm is used, and the RMSE is 0.3 mm.
Surgical navigation-assisted pedicle screw fixation still faces challenges in achieving accurate the pre- and intra-operative registration. First, pre- and intra-operative point clouds from different imaging devices vary significantly in density and position. Second, intra-operative scanning limitations—such as limited anatomical exposure, soft tissue occlusion, and noise—reduce the low overlapping ratio between pre- and intra-operative point clouds. The point cloud registration algorithms in the above application will fail in this navigation-assisted context.
In terms of the existing classical feature extraction registration algorithms, Liu [18] proposed an algorithm integrating feature regions and Super4PCS for coarse registration under low overlap, which improves accuracy, success rate and reduces computation time compared to Super4PCS. Wang [19] developed a coarse registration algorithm using grid normal deviation angle statistics, improving alignment accuracy for partially overlapping point clouds. To address noise and low overlap, Zhang [20] proposed an SSR feature descriptor and design SSR-Net. The results demonstrated the effectiveness of the proposed method. Yan [21] proposed a hybrid approach combining SHOT with RANSAC for initial alignment, followed by symmetric ICP refinement, achieving clinically acceptable registration precision. Zhang [22] optimized registration by selecting FPS-based local regions for SAC-IA before ICP refinement, outperforming other methods in accuracy and success rate.
However, classic feature extraction-based point cloud registration algorithms have higher computational complexity, long execution times, and weaker noise robustness compared to deep learning algorithms [23,24,25,26,27]. To address this issue, Zhang [28] proposed an end-to-end network adapting focus on overlapping regions, and experiments proved it outperformed existing methods above 30% overlap and detects these regions accurately. Zhou [29] introduced SCANet, a spatial and channel attention network, which outperforms existing algorithms on partially overlapping point clouds with Gaussian noise. Li [30] proposed a transformer-based algorithm to address low overlap challenges. This algorithm employed a learnable geometric position update module and a deeper cross-attention module. Results demonstrate the proposed algorithm exhibits improvements over existing similar methods.
These methods do not consider both cross-source and a low overlapping ratio simultaneously, and cannot be directly applied to the pre- and intra-operative point clouds registration of pedicle screw fixation. Therefore, inspired by [31], this paper proposes a novel coarse-to-fine registration framework. The main contributions of this paper are as follows,
A novel and robust coarse-to-fine registration framework has been proposed to address significant initial pose differences, low overlapping ratio, and noise interference issues.A novel Curvature Feature Learning-based Point Matching (CFL-PM) algorithm based on a curvature feature coder and graph attention network is proposed. The algorithm effectively generates more reliable correspondences for coarse registration and shows strong anti-interference ability against noise.A challenging dataset consisting of cross-source, low-overlapping pre- and intra-operative point cloud pairs to simulate real surgical environments. The noise-free conditions simulate an ideal surgical scenario, while noisy conditions simulate various noises present in the surgical field, such as soft tissues and blood. The results verified the feasibility and robustness of the proposed algorithm in the surgical navigation system.
The structure of this paper is as follows, Section 2 introduces the proposed algorithm. Section 3 describes the experimental setup, the results and the analysis. Section 4 concludes the paper.
2. Materials and Methods
In surgical navigation system-assisted pedicle screw fixation, an important prerequisite for accurate intra-operative positioning is the alignment of the patient’s pre-operative point cloud with the actual intra-operative point cloud. The algorithmic framework proposed in this paper is shown in Figure 1.
The proposed algorithm identifies and registers the corresponding relationship by learning point features. During the training process, firstly, the intra-and the pre- with corresponding spines operative point clouds containing curvature information are input into the point-wise feature encoder, to obtain the key point subset , and corresponding feature vectors , .
Secondly, through the graph attention network, the updated point features , are obtained. And the generated features are used to determine the corresponding relationship between the points in the pre- and intra-operative point clouds. Thirdly, the Random Sample Consensus (RANSAC) method is used to delete the wrong matches from the matching relationship and perform coarse registration.
During testing, the pre-operative point cloud is sampled by FPS, and divided into multiple local regions. Based on the CFL-PM algorithm, registration errors between the intra-operative point cloud and each local region are compared to identify the highest-overlap region. The local region with the minimum error provides the coarse registration transformation. Finally, ICP is used for further optimization to obtain the optimal rotation matrix and translation vector .
2.1. Curvature Feature Encoder
The curvature feature encoder consists of a point-wise feature encoder and a graph attention network [31]. The network structure of a point-wise feature encoder is shown in Figure 2, including three Set Abstraction (SA) layers SA1–SA3 and one Feature Propagation (FP) layer [32].
The input to the curvature feature encoder consists of 3D point coordinates and 1D corresponding features. The curvature value of the point is taken as the feature input, and its calculation is shown in Equation (1) [33,34],
where , , are the eigenvalues of the covariance matrix of the point-fitting surface ( ). The curvature estimation neighborhood radius in this study is set to 5 mm.
The pre- and intra-operative point clouds are input to the encoder separately, and after the SA and FP layers, the coordinates of the sampled subsets (key-points) of the pre- and intra-operative point clouds are output, denoted by X and Y, respectively, as well as their respective feature vectors , . Each SA layer does the FPS method for the points from the previous layer, which reduces the number of key points and improves the computational time without losing the information of the points on the overlapping region. The hyper-parameters n, r, and L in the SA layer are selected through several experiments to select the values corresponding to the time when the training effect is the best.
However, the output feature vectors , represent the local information, are not related to the global context. And the features of the points will change with the change in the neighborhood point density, which increases the difficulty of correctly identifying the correspondences. In order to reduce the impact of the above problems on registration, the graph attention network is added. Through the self-attention and cross-attention of the source and target point clouds [35], the updated point features , are obtained.
2.2. Correspondence Identification and Transformation Matrix Estimation
The correspondence is obtained by comparing the features of the key points X and Y of the source and target point clouds. Through the feature vectors , , the similarity function (Equation (2)) of the key point matching represents the matching probability between X and Y,
where N is the number of key points, T is a hyper-parameter, and the SoftMax function is a normalized exponential function. Each represents the probability that the i-th source key point matches the j-th target key point, while each row represents its matching distribution across all target key points.
Next, considering that the two point clouds partially overlap, only some points have corresponding relationships. The noise of the scanner sensor and the difference in point density will also cause encoding errors and matching relationship errors. To reduce the influence of outliers, the RANSAC method is used and then estimates the transformation matrix.
The training process optimizes the encoder by minimizing the loss function [31] to maximize the match probability of the corresponding points and minimize the match probability of the non-corresponding points under the ground-truth transformation matrix to obtain the transformation matrix between the source and target point clouds and achieve coarse registration. The initial values of the hyper-parameters T in Equation (2) and in the loss function are set to 1 × 10^−2^ and 10, respectively.
2.3. Pre- and Intra-Operative Registration of Cross-Source and Low Overlapping Ratio Point Clouds
In practical applications, the low overlapping ratio and large density difference present a significant challenge to correctly identifying the correspondence and achieve successful alignment. To address this issue, the CFL-PM method is combined with the FPS to achieve pre- and intra-operative registration.
Firstly, the intra-operative point cloud is down-sampled to achieve a density that is similar to the pre-operative point cloud. For the pre-operative point cloud Q, the FPS method is employed in order to obtain a set of sampling points Q′ (Figure 3a). A neighborhood search is then performed to divide the pre-operative point cloud into multiple local regions, with the local regions forming a candidate set (Figure 3b). The search radius is set to r, and for each point p_i_ in the sampled points, its corresponding 3D region can be expressed as Equation (3).
Next, traverse the candidate set, based on the trained model, calculate the depth features of the local regions and the intra-operative point cloud, perform feature matching and pose estimation, and then calculate the rough alignment’s rotation error and translation error [36], as shown in Equations (4) and (5),
Comparing the rotation error of the matching between the intra-operative point cloud and the local regions, as shown in Equation (6). The local region corresponding to the smallest error is the optimal local region, and its corresponding transformation is the transformation of the intra- and the pre-operative point clouds for coarse registration.
The FPS and CFL-PM methods determine point correspondences to complete the coarse registration of the optimal local region with the intra-operative point cloud. Finally, ICP fine registration ensures better alignment despite large initial positional differences and low overlapping ratio, achieving cross-source, low-overlapping ratio pre- and intra-operative point cloud registration.
3. Results and Discussion
3.1. Evaluation Metrics
To assess the accuracy and robustness of the proposed algorithm for pre- and intra-operative point clouds registration, it is compared with three algorithms: ICP based on FPFH features (FPFH + ICP) [37], ICP based on SHOT features (SHOT + ICP) [38], and ICP based on FPS and FPFH (FPS + FPFH + ICP) [22]. The registration performance is gauged by rotation error and translation error . Additionally, the runtime considered is the time taken for registering two point clouds, excluding the loading time for the point cloud data.
3.2. Model Training and Testing
The patients’ CT data used come from the SpineWeb Dataset [39,40] and the affiliated hospital, including lumbar vertebrae and cervical vertebrae. Pre-operative point cloud data is obtained through three-dimensional reconstruction of the patient’s pre-operative CT images. The reconstructed pre-operative CT are 3D printed. A COMET6 structured light scanner scanned the surgically exposed areas of the patient from the 3D printed phantom to simulate an actual intra-operative procedure to acquire intra-operative point cloud data. The intra-operative point cloud contained only the single spinous process. Figure 4 shows the cumulative distribution of point cloud pairs in the dataset in terms of rotation angle , , , and translation distance . The rotation angle and distance are obtained through ground-truth matrix. The ground-truth transformation matrix for the initial cross-source point cloud samples was established by performing a manual alignment. Then, the ground-truth transformation matrix was further refined by the ICP algorithm [41]. After data augmentation, we obtained 142 points cloud samples in total.
The dataset for training and validation comprised 142 samples. The dataset was divided into a training set and a validation set at a 3:1 ratio. Each sample consists of a pair of pre- and intra-operative point clouds and a ground-truth transformation matrix for registration. The dataset includes vertebrae with varying degrees of degeneration and anatomical morphology to ensure diversity. The test set comprises additional data, as shown in Table 1. Since the pre-operative point cloud is sampled at a frame rate of 25 local regions (as shown in Figure 3b), the intra-operative exposed spinous process point cloud must be sequentially matched with each segmented local region. Therefore, each test data point contains 25 sets of pre-intra operative point cloud pairs.
During the training process, the Adam optimizer is used, with the learning rate set to 1 × 10^−4^, and the learning rate is halved every five Epochs. The batch size and Epoch are set to 4 and 100, respectively.
To verify the trained model and the proposed algorithm, the results obtained are shown in Figure 5. It can be seen that the coarse registration achieves rotation error < 15°, translation error < 10 mm, and the registration time is approximately 0.1 s.
The operating system used in the experiment was Ubuntu 18.04, conducted on a NVIDIA Tesla V100 GPU with 32 GB memory and an Intel (R) Xeon (R) Silver 4116 CPU @ 2.10 GHz. The software environment includes CUDA 11.4, Python 3.9.12, and PyTorch 1.7.1.
3.3. Experiment on Pre- and Intra-Operative Registration of Cross-Source and Low Overlapping Ratio Point Clouds
In this experiment, the trained model registers intra- and pre-operative point clouds on a noise-free dataset, and evaluates the registration performance. The test data includes three cervical vertebrae and eight lumbar vertebrae cases. The pre-operative point clouds cover complete vertebrae, while intra-operative ones focus on surgically exposed areas. Table 1 details exposed sites, overlapping ratios, and initial pose differences between the pre- and intra-operative point clouds, providing insights into the registration effectiveness and challenges.
Figure 6 illustrates the registration results of point cloud data from Table 1 under various initial poses. The first column displays the initial positions of the pre- and intra-operative point clouds. The second column shows the initial pose of the intra-operative point cloud and pre-operative spinous point cloud selected through FPS. The third column displays the coarse registration results obtained from the trained model. The fourth column reveals the fine registration results after ICP refinement. The fifth column visualizes the ultimate registration result.
Table 2 compares the accuracy and efficiency of the proposed algorithm and the other three algorithms. The results show that the proposed algorithm outperforms the others. Under the FPFH + ICP and SHOT + ICP, Data 1, 3, 6, and 8 cannot be successfully registered. Data 9 cannot be successfully registered under the FPFH + ICP. Data 11 and Data 4 fail to register under the SHOT + ICP algorithm. When the C1 spinous process is exposed in data 1, FPS + FPFH + ICP selected the best local area, but struggled with accurate alignment during the ICP stage. In contrast, the proposed algorithm successfully registered all datasets. For point cloud data that all can be successfully registered, at the coarse registration stage, the average rotation error and translation error of FPFH + ICP are 18.891° and 24.785 mm, respectively. With SHOT + ICP, the average rotation error is 20.392° and the translation error is 21.957 mm. The average rotation and translation errors of FPS + FPFH + ICP are 9.317° and 9.326 mm, respectively. Under the proposed algorithm, the average rotation error is 5.679°, and the translation error is 7.420 mm. The coarse registration error of the algorithm proposed is the smallest, which can provide a better initial pose for fine registration and accelerate convergence. The final average rotation error and translation error of the proposed algorithm are 0.342° and 0.268 mm, respectively, meeting clinical requirements. In terms of speed, proposed algorithm is the fastest, taking only 9.862 s on average, compared to much longer times for the other algorithms.
3.4. Robustness Evaluation
To evaluate the robustness of the proposed algorithm to noise, Gaussian noise with different standard deviations is added to the intra-operative point cloud, and compared with the registration algorithms based on FPFH and SHOT. The intra-operative point cloud used in the experiment still consists of scans from the exposed surgical site. The pre-operative point cloud data is manually extracted from the complete lumbar and cervical vertebrae point clouds, specifically targeting the spinous processes of the respective vertebrae. The initial positioning and orientation obtained from the previous experiment are maintained. The experimental data remains the same as the point cloud data presented in Table 1. Figure 7 visualizes the effects of adding noise with standard deviations of 0.5 mm, 0.75 mm, and 1.0 mm to intra-operative point cloud of dataset 1.
The experiment obtained the rotation error and translation error of pre- and intra-operative point clouds registration under the interference of different levels of standard deviation noise δ, as shown in Table 3. It can be seen from the results in the table that the registration errors of the three feature matching algorithms all increase with the increase in noise. Among them, both FPFH-based and SHOT-based coarse registration have results with errors greater than 60° or 60 mm. When the coarse registration error is less than 60, the mean coarse registration error under different δ is calculated as the smallest error of CFL-PM algorithm. The mean is 7.28°, and the mean of coarse registration is 9.08 mm. CFL-PM algorithm is stable for registration with or without noise, and has stronger robustness than the other two feature matching algorithms.
Figure 8 visualizes the pre- and intra-operation point clouds coarse registration results for Data 9 with Gaussian noise standard deviation of 1, which can be more intuitively seen that the proposed algorithm can improve the robustness of registration.
4. Conclusions
This study presented a novel coarse-to-fine registration framework to address the critical challenges of large initial pose differences, low overlap, and noise in surgeon-navigated pedicle screw placement. The proposed CFL-PM algorithm, rooted in curvature feature learning, demonstrated its efficacy as a robust solution for the critical coarse registration stage. The experimental results confirmed that the proposed method achieves superior performance in terms of accuracy (e.g., 0.34° rotation error and 0.27 mm translation error under ideal conditions), computational efficiency, and robustness to noise. Besides the technical metrics of accuracy and speed, the proposed registration framework holds potential impact on enhancing clinical workflows in spinal surgery. For patient counseling, surgeons can more effectively explain the surgical plans, thereby helping to alleviate patient anxiety. For intra-operative surgical judgment, the proposed method showed the ability to achieve robust and precise registration between pre- and intra-operative point clouds, even under challenging conditions of noise and low overlap. The surgeons can more confidently make decisions based on the pre-determine the optimal pedicle screw trajectory. Daly [42] proposed a markerless tracking system based on a clinical RGB-D camera, which is capable of capturing the point cloud of the intra-operative spine. The proposed CFL-PM algorithm can be utilized in this scenario to extract patient-specific spinal features, to achieve precise registration between pre-operative and intra-operative point clouds. This enables the augmented visualization of the patient’s surgical field during surgery with surgical navigation-assisted pedicle screw fixation, allowing the procedure to be executed according to the screw trajectory planned pre-operatively.
Despite these encouraging results, several limitations of the current work should be acknowledged. The dataset used although is sufficient for the algorithm, further validation through prospective clinical trials and larger datasets is necessary to establish its generalization in different patient groups and surgical environments. Future work may focus on integrating this algorithm into a clinical navigation platform and validating its benefits through clinical studies. At the same time, its applicability in other orthopedic navigation procedures beyond the placement of pedicle screws will also be explored.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Yan K. Zhang Q. Liu B. He D. Liu Y.J. Tian W. The Clinical Application of Tianji II Robot-assisted Pedicle Screw Internal Fixation for Thoracolumbar Spine Beijing Biomed. Eng.202241297301
- 2Smith A.D. Teague A.J. Naik A. Janbahan M. Smith E.J. Krist D.T. Parupalli S. Teal K. Hassaneen W. Robotic External Ventricular Drain Placement for Acute Neurosurgical Care in Low-Resource Settings: Feasibility Considerations and a Prototype Design Neurosurg. Focus 202252 E 1410.3171/2021.10.FOCUS 2154434973667 · doi ↗ · pubmed ↗
- 3Wu B. Liu P. Xiong C. Li C. Zhang F. Shen S. Shao P. Yao P. Niu C. Xu R. Stereotactic Co-Axial Projection Imaging for Augmented Reality Neuro-navigation: A Proof-Of-Concept Study Quant. Imaging Med. Surg.2022123792380210.21037/qims-21-114435782260 PMC 9246757 · doi ↗ · pubmed ↗
- 4Li Y.H. Jiang S. Yang Z.Y. Yang S. Zhou Z.Y. Microscopic augmented reality calibration with contactless line-structured light registration for surgical navigation Med. Biol. Eng. Comput.2025631463147910.1007/s 11517-025-03288-z 39806119 · doi ↗ · pubmed ↗
- 5Tu P.X. Qin C.X. Guo Y. Li D.Y. Lungu A.J. Wang H.X. Ultrasound Image Guided and Mixed Reality-Based Surgical System with Real-Time Soft Tissue Deformation Computing for Robotic Cervical Pedicle Screw Placement IEEE Trans. Biomed. Eng.2022692593260310.1109/TBME.2022.315095235157575 · doi ↗ · pubmed ↗
- 6Liu Z. Hsieh C. Hsu W. Tseng C. Chang C. Two-Dimensional C-arm Robotic Navigation System (i-Navi) in Spine Surgery: A Pilot Study Int. J. Comput. Assist. Radiol. Surg.2022172281229010.1007/s 11548-022-02751-836100733 · doi ↗ · pubmed ↗
- 7Li D. Wang M. A 3D Image Registration Method for Laparoscopic Liver Surgery Navigation Electronics 202211167010.3390/electronics 11111670 · doi ↗
- 8Zhang X.H. Otoo E.M. Fan Y.B. Tao C.J. Wang T.M. Rhode K. Autostereoscopic 3D Augmented Reality Navigation for Laparoscopic Surgery: A Preliminary Assessment IEEE Trans. Biomed. Eng.2022701413142110.1109/TBME.2022.321748136301782 · doi ↗ · pubmed ↗
