PhySe-RPO: Physics and Semantics Guided Relative Policy Optimization for Diffusion-Based Surgical Smoke Removal
Zining Fang, Cheng Xue, Chunhui Liu, Bin Xu, Ming Chen, and Xiaowei Hu

TL;DR
PhySe-RPO introduces a physics and semantics-guided diffusion framework for surgical smoke removal, enabling robust, interpretable restoration with limited supervision in surgical videos.
Contribution
It transforms deterministic restoration into a stochastic policy with physics and semantic rewards, improving robustness and interpretability in surgical smoke removal.
Findings
Produces physically consistent and semantically faithful restorations.
Effective on both synthetic and real surgical datasets.
Operates without requiring paired supervision.
Abstract
Surgical smoke severely degrades intraoperative video quality, obscuring anatomical structures and limiting surgical perception. Existing learning-based desmoking approaches rely on scarce paired supervision and deterministic restoration pipelines, making it difficult to perform exploration or reinforcement-driven refinement under real surgical conditions. We propose PhySe-RPO, a diffusion restoration framework optimized through Physics- and Semantics-Guided Relative Policy Optimization. The core idea is to transform deterministic restoration into a stochastic policy, enabling trajectory-level exploration and critic-free updates via group-relative optimization. A physics-guided reward imposes illumination and color consistency, while a visual-concept semantic reward learned from CLIP-based surgical concepts promotes smoke-free and anatomically coherent restoration. Together with a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
