Long-Term Temporally Consistent Unpaired Video Translation from   Simulated Surgical 3D Data

Dominik Rivoir; Micha Pfeiffer; Reuben Docea; Fiona Kolbinger; Carina; Riediger; J\"urgen Weitz; Stefanie Speidel

arXiv:2103.17204·cs.CV·August 20, 2021

Long-Term Temporally Consistent Unpaired Video Translation from Simulated Surgical 3D Data

Dominik Rivoir, Micha Pfeiffer, Reuben Docea, Fiona Kolbinger, Carina, Riediger, J\"urgen Weitz, Stefanie Speidel

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel method combining unpaired image translation and neural rendering to achieve long-term, view-consistent video translation from simulated to photorealistic surgical scenes, enhancing training and evaluation in medical applications.

Contribution

It proposes a new approach that integrates global textures and view-consistency loss to produce globally consistent, long-term videos from simulated surgical data.

Findings

01

Achieves long-term temporal consistency in unpaired video translation.

02

Produces view-consistent, photorealistic surgical videos from simulated data.

03

Enables better training and evaluation for surgical applications.

Abstract

Research in unpaired video translation has mainly focused on short-term temporal consistency by conditioning on neighboring frames. However for transfer from simulated to photorealistic sequences, available information on the underlying geometry offers potential for achieving global consistency across views. We propose a novel approach which combines unpaired image translation with neural rendering to transfer simulated to photorealistic surgical abdominal scenes. By introducing global learnable textures and a lighting-invariant view-consistency loss, our method produces consistent translations of arbitrary views and thus enables long-term consistent video synthesis. We design and test our model to generate video sequences from minimally-invasive surgical abdominal scenes. Because labeled data is often limited in this domain, photorealistic data where ground truth information from the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://gitlab.com/nct_tso_public/surgical-video-sim2real
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Vision and Imaging · Computer Graphics and Visualization Techniques