Geometry OR Tracker: Universal Geometric Operating Room Tracking
Yihua Shao, Kang Chen, Feng Xue, Siyu Chen, Long Bai, Hongyuan Yu, Hao Tang, Jinlin Wu, Nassir Navab

TL;DR
This paper introduces Geometry OR Tracker, a two-stage system that improves multi-view 3D tracking in operating rooms by rectifying calibration errors and robustly tracking in a unified world frame, enhancing accuracy in clinical settings.
Contribution
The novel two-stage pipeline combines geometric rectification with occlusion-robust tracking, addressing calibration issues common in real clinical environments.
Findings
Reduces cross-view depth disagreement by over 30×
Improves tracking accuracy with better geometric consistency
Demonstrates the importance of calibration quality for tracking performance
Abstract
In operating rooms (OR), world-scale multi-view 3D tracking supports downstream applications such as surgeon behavior recognition, where physically meaningful quantities such as distances and motion statistics must be measured in meters. However, real clinical deployments rarely satisfy the geometric prerequisites for stable multi-view fusion and tracking: camera calibration and RGB-D registration are always unreliable, leading to cross-view geometric inconsistency that produces "ghosting" during fusion and degrades 3D trajectories in a shared OR coordinate frame. To address this, we introduce Geometry OR Tracker, a two-stage pipeline that first rectifies imprecise calibration into a scaleconsistent and geometrically consistent camera setup with a single global scale via a Multi-view Metric Geometry Rectification module, and then performs Occlusion-Robust 3D Point Tracking directly in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Augmented Reality Applications · Advanced Vision and Imaging
