TAToo: Vision-based Joint Tracking of Anatomy and Tool for Skull-base Surgery
Zhaoshuo Li, Hongchao Shu, Ruixing Liang, Anna Goodridge, Manish Sahu,, Francis X. Creighton, Russell H. Taylor, Mathias Unberath

TL;DR
TAToo is a novel vision-based system that jointly tracks the 3D motion of both the surgical tool and patient anatomy in skull-base surgery using stereo videos, enabling markerless intra-operative guidance.
Contribution
TAToo introduces an end-to-end differentiable, probabilistic approach for simultaneous 3D tracking of anatomy and tools directly from surgical videos, without markers.
Findings
Achieves sub-millimeter accuracy for skull tracking
Attains rotation errors below 1 degree
Performs favorably compared to existing methods
Abstract
Purpose: Tracking the 3D motion of the surgical tool and the patient anatomy is a fundamental requirement for computer-assisted skull-base surgery. The estimated motion can be used both for intra-operative guidance and for downstream skill analysis. Recovering such motion solely from surgical videos is desirable, as it is compliant with current clinical workflows and instrumentation. Methods: We present Tracker of Anatomy and Tool (TAToo). TAToo jointly tracks the rigid 3D motion of patient skull and surgical drill from stereo microscopic videos. TAToo estimates motion via an iterative optimization process in an end-to-end differentiable form. For robust tracking performance, TAToo adopts a probabilistic formulation and enforces geometric constraints on the object level. Results: We validate TAToo on both simulation data, where ground truth motion is available, as well as on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSurgical Simulation and Training · Augmented Reality Applications · Robotics and Sensor-Based Localization
