Seeing Through Smoke: Surgical Desmoking for Improved Visual Perception

Jingpei Lu; Fengyi Jiang; Xiaorui Zhang; Lingbo Jin; Omid Mohareri

arXiv:2603.25867·cs.CV·March 30, 2026

Seeing Through Smoke: Surgical Desmoking for Improved Visual Perception

Jingpei Lu, Fengyi Jiang, Xiaorui Zhang, Lingbo Jin, Omid Mohareri

PDF

1 Datasets

TL;DR

This paper introduces a transformer-based model for surgical desmoking that enhances endoscopic image clarity, utilizing a large synthetic dataset and real surgical images to improve visualization and downstream tasks.

Contribution

The authors develop a novel physics-inspired desmoking model, create the largest paired surgical smoke dataset, and demonstrate state-of-the-art performance in image reconstruction and downstream surgical tasks.

Findings

01

State-of-the-art image reconstruction performance.

02

Synthetic data pipeline yields over 80,000 training samples.

03

Desmoking improves stereo depth estimation and instrument segmentation.

Abstract

Minimally invasive and robot-assisted surgery relies heavily on endoscopic imaging, yet surgical smoke produced by electrocautery and vessel-sealing instruments can severely degrade visual perception and hinder vision-based functionalities. We present a transformer-based surgical desmoking model with a physics-inspired desmoking head that jointly predicts smoke-free image and corresponding smoke map. To address the scarcity of paired smoky-to-smoke-free training data, we develop a synthetic data generation pipeline that blends artificial smoke patterns with real endoscopic images, yielding over 80,000 paired samples for supervised training. We further curate, to our knowledge, the largest paired surgical smoke dataset to date, comprising 5,817 image pairs captured with the da Vinci robotic surgical system, enabling benchmarking on high-resolution endoscopic images. Extensive experiments…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

artJiang20/SeeThroughSmoke
dataset· 134 dl
134 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.