4D Spatio-Temporal Convolutional Networks for Object Position Estimation   in OCT Volumes

Marcel Bengs; Nils Gessert; Alexander Schlaefer

arXiv:2007.01044·eess.IV·July 3, 2020

4D Spatio-Temporal Convolutional Networks for Object Position Estimation in OCT Volumes

Marcel Bengs, Nils Gessert, Alexander Schlaefer

PDF

TL;DR

This paper introduces 4D spatio-temporal CNNs for improved object position estimation in OCT volumes, leveraging temporal data to enhance tracking accuracy in surgical applications.

Contribution

It systematically extends 3D CNNs to 4D CNNs for OCT-based object tracking, demonstrating significant accuracy improvements by incorporating temporal information.

Findings

01

4D CNNs reduce mean absolute error by 30%

02

Temporal streams improve marker object tracking accuracy

03

4D approach outperforms single volume 3D CNNs

Abstract

Tracking and localizing objects is a central problem in computer-assisted surgery. Optical coherence tomography (OCT) can be employed as an optical tracking system, due to its high spatial and temporal resolution. Recently, 3D convolutional neural networks (CNNs) have shown promising performance for pose estimation of a marker object using single volumetric OCT images. While this approach relied on spatial information only, OCT allows for a temporal stream of OCT image volumes capturing the motion of an object at high volumes rates. In this work, we systematically extend 3D CNNs to 4D spatio-temporal CNNs to evaluate the impact of additional temporal information for marker object tracking. Across various architectures, our results demonstrate that using a stream of OCT volumes and employing 4D spatio-temporal convolutions leads to a 30% lower mean absolute error compared to single…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.