TL;DR
This paper introduces a novel image translation model that enhances synthetic endoscopic images with realistic textures while preserving key structural details, enabling effective training for endoscopic segmentation tasks without real labeled data.
Contribution
A new structure-aware image translation method that maintains scene layout in synthetic endoscopy images, improving training for segmentation tasks without real annotations.
Findings
Generated images improve fold segmentation accuracy.
Method outperforms existing image translation techniques.
Effective training on synthetic data generalizes to real data.
Abstract
Automatic segmentation of anatomical landmarks in endoscopic images can provide assistance to doctors and surgeons for diagnosis, treatments or medical training. However, obtaining the annotations required to train commonly used supervised learning methods is a tedious and difficult task, in particular for real images. While ground truth annotations are easier to obtain for synthetic data, models trained on such data often do not generalize well to real data. Generative approaches can add realistic texture to it, but face difficulties to maintain the structure of the original scene. The main contribution in this work is a novel image translation model that adds realistic texture to simulated endoscopic images while keeping the key scene layout information. Our approach produces realistic images in different endoscopy scenarios. We demonstrate these images can effectively be used to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
